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PREFACE 


Functional analysis is the study of certain topological-algebraic structures 
and of the methods by which knowledge of these structures can be applied 
to analytic problems. 

A good introductory text on this subject should include a presentation 
of its axiomatics (i.e., of the general theory of topological vector spaces), it 
should treat at least a few topics in some depth, and it should contain some 
interesting applications to other branches of mathematics. I hope that the 
present book meets these criteria. 

The subject is huge and is growing rapidly. (The bibliography in 
volume I of [4] contains 96 pages and goes only to 1957.) In order to write 
a book of moderate size, it was therefore necessary to select certain areas 
and to ignore others. I fully realize that almost any expert who looks at the 
table of contents will find that some of his or her (and my) favorite topics 
are missing, but this seems unavoidable. It was not my intention to write an 
encyclopedic treatise. I wanted to write a book that would open the way to 
further exploration. 

This 1s the reason for omitting many of the more esoteric topics that 
might have been included in the presentation of the general theory of topo- 
logical vector spaces. For instance, there is no discussion of uniform spaces, 
of Moore-Smith convergence, of nets, or of filters. The notion of complete- 
ness occurs only in the context of metric spaces. Bornological spaces are 
not mentioned, nor are barreled ones. Duality is of course presented, but 
not in its utmost generality. Integration of vector-valued functions is treated 
strictly as a tool; attention is confined to continuous integrands, with values 
in a Frechet space. 

Nevertheless, the material of Part I is fully adequate for almost all 
applications to concrete problems. And this is what ought to be stressed in 
such a course: The close interplay between the abstract and the concrete is 
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not only the most useful aspect of the whole subject but also the most 
fascinating one. 

Here are some further features of the selected material. A fairly large 
part of the general theory is presented without the assumption of local con- 
vexity. The basic properties of compact operators are derived from the 
duality theory in Banach spaces. The Krein-Milman theorem on the exis- 
tence of extreme points is used in several ways in Chapter 5. The theory of 
distributions and Fourier transforms is worked out in fair detail and is 
applied (in two very brief chapters) to two problems in partial differential 
equations, as well as to Wiener’s tauberian theorem and two of its applica- 
tions. The spectral theorem is derived from the theory of Banach algebras 
(specifically, from the Gelfand-Naimark characterization of commutative 
B*-algebras); this is perhaps not the shortest way, but it is an easy one. The 
symbolic calculus in Banach algebras is discussed in considerable detail; so 
are involutions and positive functionals. 

I assume familiarity with the theory of measure and Lebesgue integra- 
tion (including such facts as the completeness of the I?-spaces), with some 
basic properties of holomorphic functions (such as the general form of 
Cauchy’s theorem, and Runge’s theorem), and with the elementary topo- 
logical background that goes with these two analytic topics. Some other 
topological facts are briefly presented in Appendix A. Almost no algebraic 
background is needed, beyond the knowledge of what a homomorphism is. 

Historical references are gathered in Appendix B. Some of these refer 
to the original sources, and some to more recent books, papers, or exposi- 
tory articles in which further references can be found. There are, of course, 
many items that are not documented at all. In no case does the absence of a 
specific reference imply any claim to originality on my part. 

Most of the applications are in Chapters 5, 8, and 9. Some are in 
Chapter 11 and in the more than 250 exercises; many of these are supplied 
with hints. The interdependence of the chapters is indicated in the diagram 
on the following page. 

Most of the applications contained in Chapter 5 can be taken up well 
before the first four chapters are completed. It has therefore been suggested 
that it might be good pedagogy to insert them into the text earlier, as soon 
as the required theoretical background is established. However, in order 
not to interrupt the presentation of the theory in this way, I have instead 
started Chapter 5 with a short indication of the background that is needed 
for each item. This should make it easy to study the applications as early as 
possible, if so desired. 

In the first edition, a fairly large part of Chapter 10 dealt with differ- 
entiation in Banach algebras. Twenty years ago this (then recent) material 
looked interesting and promising, but it does not seem to have led any- 
where, and I have deleted it. On the other hand, I have added a few items 
which were easy to fit into the existing text: the mean ergodic theorem of 
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von Neumann, the Hille-Yosida theorem on semigroups of operators, a 
couple of fixed point theorems, Bonsall’s surprising application of the 
closed range theorem, and Lomonosov’s spectacular invariant subspace 
theorem. I have also rewritten a few sections in order to clarify certain 
details, and I have shortened and simplified some proofs. 

Most of these changes have been made in response to much- 
appreciated suggestions by numerous friends and colleagues. I especially 
want to mention Justin Peters and Ralph Raimi, who wrote detailed 
critiques of the first edition, and the Russian translator of the first edition 
who added quite a few relevant footnotes to the text. My thanks to all of 
them! 

Walter Rudin 


FUNCTIONAL ANALYSIS 


PART 


I 


GENERAL 
THEORY 


CHAPTER 


TOPOLOGICAL 
VECTOR 
SPACES 


Introduction 


1.1 Many problems that analysts study are not primarily concerned with 
a single object such as a function, a measure, or an operator, but they deal 
instead with large classes of such objects. Most of the interesting classes 
that occur in this way turn out to be vector spaces, either with real scalars 
or with complex ones. Since limit processes play a role in every analytic 
problem (explicitly or implicitly), it should be no surprise that these vector 
spaces are supplied with metrics, or at least with topologies, that bear some 
natural relation to the objects of which the spaces are made up. The sim- 
plest and most important way of doing this is to introduce a norm. The 
resulting structure (defined below) is called a normed vector space, or a 
normed linear space, or simply a normed space. 

Throughout this book, the term vector space will refer to a vector 
space over the complex field ¢ or over the real field R. For the sake of 
completeness, detailed definitions are given in Section 1.4. 


1.2 Normed spaces A vector space X is said to be a normed space if to 
every x € X there is associated a nonnegative real number ||x||, called the 
norm of x, in such a way that 
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(a) ||x + yll < |lxll + Ilyll for all x and y in X, 
(b) |lax|| = |a| ||x|| if x e X and «is a scalar, 
(c) |x| > Oifx 40. 


The word “norm” is also used to denote the function that maps x 
to ||x||. 

Every normed space may be regarded as a metric space, in which the 
distance d(x, y) between x and y is ||x — y||. The relevant properties of d are 


(i) O< d(x, y) < o for all x and y, 
(ii) d(x, y) = Oif and only if x = y, 
(iii) d(x, y) = d(y, x) for all x and y, 
(iv) d(x, z) < d(x, y) + d(y, z) for all x, y, z. 


In any metric space, the open ball with center at x and radius r is 
the set 


B(x) = {y: d(x, y) <r}. 
In particular, if X is a normed space, the sets 
B,(0) = {x: ||x|| < 1} and B,(0) = {x: ||xl| < 1} 


are the open unit ball and the closed unit ball of X, respectively. 

By declaring a subset of a metric space to be open if and only if it is a 
(possibly empty) union of open balls, a topology is obtained. (See Section 
1.5.) It is quite easy to verify that the vector space operations (addition and 
scalar multiplication) are continuous in this topology, if the metric is 
derived from a norm, as above. 

A Banach space is a normed space which is complete in the metric 
defined by its norm; this means that every Cauchy sequence is required to 
converge. 


13 Many of the best-known function spaces are Banach spaces. Let us 
mention just a few types: spaces of continuous functions on compact 
spaces; the familiar [?-spaces that occur in integration theory; Hilbert 
spaces — the closest relatives of euclidean spaces; certain spaces of differen- 
tiable functions; spaces of continuous linear mappings from one Banach 
space into another; Banach algebras. All of these will occur later on in the 
text. 

But there are also many important spaces that do not fit into this 
framework. Here are some examples: 


(a) C(Q), the space of all continuous complex functions of some open set 
Q in a euclidean space R". 
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(b) H(Q), the space of all holomorphic functions in some open set Q in the 
complex plane. 


(c) Cx, the space of all infinitely differentiable complex functions on R" 
that vanish outside some fixed compact set K with nonempty interior. 


(d) The test function spaces used in the theory of distributions, and the 
distributions themselves. 


These spaces carry natural topologies that cannot be induced by 
norms, as we shall see later. They, as well as the normed spaces, are exam- 
ples of topological vector spaces, a concept that pervades all of functional 
analysis. 

After this brief attempt at motivation, here are the detailed definitions, 
followed (in Section 1.9) by a preview of some of the results of Chapter 1. 


1.4 Vector spaces The letters R and ¢ will always denote the field of 
real numbers and the field of complex numbers, respectively. For the 
moment, let ® stand for either R or ¢. A scalar is a member of the scalar 
field ®. A vector space over ® is a set X, whose elements are called vectors, 
and in which two operations, addition and scalar multiplication, are defined, 
with the following familiar algebraic properties: 


(a) To every pair of vectors x and y corresponds a vector x + y, in sucha 
way that 


x+ty=ytx and x+(yt+z)=(x4+ y) +2; 


X contains a unique vector 0 (the zero vector or origin of X) such that 
x +0=~x for every x e X; and to each x € X corresponds a unique 
vector —x such that x + (—x) = 0. 

(b) To every pair (a, x) with a e ® and x € X corresponds a vector ax, in 
such a way that 


Ix=x, a(Bx) = (af)x, 
and such that the two distributive laws 
a(x + y)=ax + ay, (a + B)x = ax + Bx 
hold. 


The symbol 0 will of course also be used for the zero element of the 
scalar field. 

A real vector space is one for which ® = R; a complex vector space is 
one for which ® = ¢. Any statement about vector spaces in which the 
scalar field is not explicitly mentioned is to be understood to apply to both 
of these cases. 
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If X is a vector space, Ac X, Bc X,x e€ X, and Ae 9, the following 
notations will be used: 


x+A={x+a:ae A}, 

x—-Az={x—a:ae A}, 

A+B={a+b:aeA, be B}, 
AA = {ha: ae A}. 


In particular (taking 4 = —1), —A denotes the set of all additive inverses of 
members of A. 

A word of warning: With these conventions, it may happen that 2A # 
A + A (Exercise 1). 

A set Y < X is called a subspace of X if Y is itself a vector space (with 
respect to the same operations, of course). One checks easily that this 
happens if and only if 0 € Y and 


aY+PYcY 


for all scalars « and B. 
A set C c X 1s said to be convex if 


IC+(1—-)CcC OK<t<1). 


In other words, it is required that C should contain tx + (1 — t)y if x EC, 
yeC,and0 <t <1. 

A set Bc X is said to be balanced if «B < B for every a € ® with 
la| <1. 

A vector space X has dimension n (dim X =n) if X has a basis 
{u,,..., u,}. This means that every x € X has a unique representation of the 
form 


X= 0,U, t'°'+4,4u, (a; € D). 


If dim X =n for some n, X is said to have finite dimension. If X = {0}, then 
dim X = 0. 


Example. If X = ¢ (a one-dimensional vector space over the scalar 
field ¢), the balanced sets are ¢, the empty set @, and every circular 
disc (open or closed) centered at 0. If X = R? (a two-dimensional 
vector space over the scalar field R), there are many more balanced 
sets; any line segment with midpoint at (0, 0) will do. The point is 
that, in spite of the well-known and obvious identification of ¢ with 
R*, these two are entirely different as far as their vector space struc- 
ture is concerned. 


1.5 Topological spaces A topological space is a set S in which a collec- 
tion t of subsets (called open sets) has been specified, with the following 
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properties: S is open, @ is open, the intersection of any two open sets is 
open, and the union of every collection of open sets is open. Such a collec- 
tion t is called a topology on S. When clarity seems to demand it, the topo- 
logical space corresponding to the topology t will be written (S, t) rather 
than S. 

Here is some of the standard vocabulary that will be used, if S and t 
are as above. 

A set E c S is closed if and only if its complement is open. The closure 
E of E is the intersection of all closed sets that contain E. The interior E° of 
E is the union of all open sets that are subsets of E. A neighborhood of a 
point p € S is any open set that contains p. (S, t) is a Hausdorff space, and t 
is a Hausdorff topology, if distinct points of S have disjoint neighborhoods. 
A set K cS is compact if every open cover of K has a finite subcover. A 
collection t’ c t is a base for t if every member of t (that is, every open set) 
is a union of members of 1’. A collection y of neighborhoods of a point 
p € Sis a local base at p if every neighborhood of p contains a member of y. 

If ES and if o is the collection of all intersections E 7 V, with 
V et, then o is a topology on E, as is easily verified; we call this the topol- 
ogy that E inherits from S. 

If a topology t is induced by a metric d (see Section 1.2) we say that d 
and t are compatible with each other. 

A sequence {x,} in a Hausdorff space X converges to a point x € X 
(or lim,..., X, = x) if every neighborhood of x contains all but finitely many 
of the points x,,. 


1.6 Topological vector spaces Suppose t is a topology on a vector 
space X such that 


(a) every point of X is a closed set, and 
(b) the vector space operations are continuous with respect to t. 


Under these conditions, t is said to be a vector topology on X, and X 
is a topological vector space. 

Here is a more precise way of stating (a): For every x € X, the set {x} 
which has x as its only member is a closed set. 

In many texts, (a) is omitted from the definition of a topological 
vector space. Since (a) is satisfied in almost every application, and since 
most theorems of interest require (a) in their hypotheses, it seems best to 
include it in the axioms. [Theorem 1.12 will show that (a) and (b) together 
imply that t is a Hausdorff topology. ] 

To say that addition is continuous means, by definition, that the 
mapping 


(x, y)y>x+y 
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of the cartesian product X x X into X is continuous: If x; € X for i = 1, 2, 
and if V is a neighborhood of x, + x,, there should exist neighborhoods V, 
of x; such that 


V, + Vz Vz. 


Similarly, the assumption that scalar multiplication is continuous means 
that the mapping 


(a, x) > ax 


of ® x X into X is continuous: If x € X, a is a scalar, and V is a neighbor- 
hood of ax, then for some r > 0 and some neighborhood W of x we have 
BW <— V whenever | B — «| <r. 

A subset E of a topological vector space is said to be bounded if to 
every neighborhood V of 0 in X corresponds a number s > 0 such that 
E ctV foreveryt>s. 


1.7 Invariance Let X be a topological vector space. Associate to each 
ae X and to each scalar 4 # 0 the translation operator T,, and the multipli- 
cation operator M,, by the formulas 


T,(x) =a+ x, M ,(x) = Ax (x € X). 


The following simple proposition is very important: 


Proposition. 7, and M, are homeomorphisms of X onto X. 


PROOF. The vector space axioms alone imply that T, and M, are 
one-to-one, that they map X onto X, and that their inverses are T_, 
and M,,,, respectively. The assumed continuity of the vector space 
operations implies that these four mappings are continuous. Hence 
each of them is a homeomorphism (a continuous mapping whose 
inverse is also continuous). //// 


One consequence of this proposition is that every vector topology Tt is 
translation-invariant (or simply invariant, for brevity): A set E < X is open if 
and only if each of its translates a + E is open. Thus t is completely deter- 
mined by any local base. 

In the vector space context, the term local base will always mean a 
local base at 0. A local base of a topological vector space X is thus a 
collection & of neighborhoods of 0 such that every neighborhood of 0 con- 
tains a member of &. The open sets of X are then precisely those that are 
unions of translates of members of &. 
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A metric d on a vector space X will be called invariant if 


d(x + z, y+ z) = d(x, y) 


for all x, y, zin X. 


1.8 Types of topological vector spaces In the following definitions, X 
always denotes a topological vector space, with topology t. 


(a) 


(b) 
(c) 
(d) 
(e) 


X 1s locally convex if there is a local base @ whose members are 
convex. 


X is locally bounded if 0 has a bounded neighborhood. 
X is locally compact if 0 has a neighborhood whose closure is compact. 
X is metrizable if t is compatible with some metric d. 


X is an F-space if its topology t is induced by a complete invariant 
metric d. (Compare Section 1.25.) 


X is a Fréchet space if X is a locally convex F-space. 


X is normable if a norm exists on X such that the metric induced by 
the norm is compatible with t. 

Normed spaces and Banach spaces have already been defined (Section 
1.2). 

X has the Heine-Borel property if every closed and bounded subset of 
X is compact. 


The terminology of (e) and (f/f) is not universally agreed upon: In 


some texts, local convexity is omitted from the definition of a Fréchet space, 
whereas others use F-space to describe what we have called Fréchet space. 


1.9 Here is a list of some relations between these properties of a topologi- 
cal vector space X. 


(a) 
(b) 
(c) 
(d) 
(e) 


If X is locally bounded, then X has a countable local base [part (c) of 
Theorem 1.15]. 

X is metrizable if and only if X has a countable local base (Theorem 
1.24). 

X is normable if and only if X is locally convex and locally bounded 
(Theorem 1.39). 

X has finite dimension if and only if X is locally compact (Theorems 
1214222). 

If a locally bounded space X has the Heine-Borel property, then X has 
finite dimension (Theorem 1.23). 
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The spaces H(Q) and Cf mentioned in Section 1.3 are infinite- 
dimensional Fréchet spaces with the Heine-Borel property (Sections 1.45, 
1.46). They are therefore not locally bounded, hence not normable; they 
also show that the converse of (a) is false. 

On the other hand, there exist locally bounded F-spaces that are not 
locally convex (Section 1.47). 


Separation Properties 


1.10 Theorem Suppose K and C are subsets of a topological vector space 
X, K is compact, C is closed, and K ~\ C = @. Then 0 has a neighborhood V 
such that 


(K+ V)N(C+V)=@. 


Note that K + V is a union of translates x + V of V (x € K). Thus 
K + V is an open set that contains K. The theorem thus implies the exis- 
tence of disjoint open sets that contain K and C, respectively. 


PROOF. We begin with the following proposition, which will be useful 
in other contexts as well: 


If W is a neighborhood of 0 in X, then there is a neighborhood U 
of 0 which is symmetric (in the sense that U = —U) and which satisfies 
U+Uc W. 


To see this, note that 0 + 0 = 0, that addition is continuous, and 
that 0 therefore has neighborhoods V,, V, such that V, + V, < W. If 


U=V,0 Vz 0 (—V,) 9 (— V3), 


then U has the required properties. 
The proposition can now be applied to U in place of W and 
yields a new symmetric neighborhood U of 0 such that 


U+U+U+4+Uc W. 


It is clear how this can be continued. 

If K = @, then K + V = @, and the conclusion of the theorem 
is obvious. We therefore assume that K # @, and consider a point 
x € K. Since C is closed, since x is not in C, and since the topology of 
X is invariant under translations, the preceding proposition shows 
that 0 has a symmetric neighborhood V, such that x + V, + V+ V, 
does not intersect C; the symmetry of V, shows then that 


(1) (x +V,4 VJ) 0(C+V)= S. 
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Since K is compact, there are finitely many points x,,..., xX, 1n K such 
that 


Rey + Vo) ue eX, 
Put V=V,, 9 °°': 0 V,,. Then 


K+Vc U x, +V,,+V)e LU (xi + Ye, + Ve), 
i=1 i=1 


and no term in this last union intersects C + V, by (1). This completes 
the proof. iff 


Since C + V is open, it is even true that the closure of K + V does not 
intersect C + V; in particular, the closure of K + V does not intersect C. 
The following special case of this, obtained by taking K = {0}, is of con- 
siderable interest. 


1.11 Theorem If 2 is a local base for a topological vector space X, then 
every member of # contains the closure of some member of &. 


So far we have not used the assumption that every point of X is a 
closed set. We now use it and apply Theorem 1.10 to a pair of distinct 
points in place of K and C. The conclusion is that these points have disjoint 
neighborhoods. In other words, the Hausdorff separation axiom holds: 


1.12 Theorem Every topological vector space is a Hausdorff space. 


We now derive some simple properties of closures and interiors in a 


topological vector space. See Section 1.5 for the notations E and E’. 
Observe that a point p belongs to E if and only if every neighborhood of p 
intersects E. 


1.13. Theorem Let X be a topological vector space. 


(a) If Ac X then A =() (A+ V), where V runs through all neighborhoods 
of 0. 

(b) IfAcXandBcX,thnA+BcA+B. 

(c) If Y is a subspace of X, so is Y. 

(d) If C is a convex subset of X, so are C and C°. 

(e) If Bisa balanced subset of X, so is B; if also 0 € B° then B° is balanced. 

(f) If E is a bounded subset of X, so is E. 
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PROOF. (a) x € A if and only if (x + V) A A # @ for every neighbor- 
hood V of 0, and this happens if and only if x € A — V for every such 
V. Since — V is a neighborhood of 0 if and only if V is one, the proof 
is complete. 


(b) Take ae A, b € B; let W be a neighborhood of a + b. There 
are neighborhoods W, and W, of a and b such that W, + W,c W. 
There exist x € A A W, and ye Bn W,, sincea € A and be B. Then 
x + y lies in (A + B) A W, so that this intersection is not empty. Con- 
sequently,a+beEeA+B. 


(c) Suppose a and £ are scalars. By the proposition in Section 
1.7, aY=aY if a #0; if « =0, these two sets are obviously equal. 
Hence it follows from (b) that 


aY+ BY=aY +BY caY+PYcY: 


the assumption that Y is a subspace was used in the last inclusion. 

The proofs that convex sets have convex closures and that bal- 
anced sets have balanced closures are so similar to this proof of (c) 
that we shall omit them from (d) and (e). 


(d) Since C° < C and C is convex, we have 
ce +(1-1)CceocC 


if 0 <t < 1. The two sets on the left are open; hence so is their sum. 
Since every open subset of C is a subset of C°, it follows that C° is 
convex. 

(e) If O0<|a| <1, then aB° = (aB)°, since x > ax is a homeo- 
morphism. Hence «B° < aB c B, since B is balanced. But «B° is open. 
So aB° c B®. If B° contains the origin, then «B° < B° even for a = 0. 

(f) Let V be a neighborhood of 0. By Theorem 1.11, Wc V for 
some neighborhood W of 0. Since E is bounded, E < tW for all suffi- 
ciently large t. For these t, we have Ec tWc tV. //// 


1.14 Theorem Ina topological vector space X, 


(a) every neighborhood of 0 contains a balanced neighborhood of 0, and 


(b) every convex neighborhood of 0 contains a balanced convex neighbor- 
hood of 0. 


PROOF. (a) Suppose U is a neighborhood of 0 in X. Since scalar multi- 
plication is continuous, there is a 6 > 0 and there is a neighborhood 
V of Oin X such that «V < U whenever | «| < 6. Let W be the union 
of all these sets «V. Then W is a neighborhood of 0, W is balanced, 
andWc U. 
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(b) Suppose U is a convex neighborhood of 0 in X. Let 
A=() aU, where a ranges over the scalars of absolute value 1. 
Choose W as in part (a). Since W is balanced, a 'W=W when 
|a| = 1; hence W c aU. Thus W c A, which implies that the interior 
A® of A is a neighborhood of 0. Clearly A° c U. Being an intersection 
of convex sets, A is convex; hence so is A°. To prove that A® is a 
neighborhood with the desired properties, we have to show that A°® is 
balanced; for this it suffices to prove that A is balanced. Choose r and 
B so thatO<r<1,|B| = 1. Then 


rBA= (\ rBaU = () rau. 


la] =1 la] =1 


Since «U is a convex set that contains 0, we have raU c aU. Thus 
rBA < A, which completes the proof. //// 


Theorem 1.14 can be restated in terms of local bases. Let us say that a 


local base & is balanced if its members are balanced sets, and let us call Z 
convex if its members are convex sets. 


1.15 


Corollary 


(a) Every topological vector space has a balanced local base. 
(b) Every locally convex space has a balanced convex local base. 


Recall also that Theorem 1.11 holds for each of these local bases. 


Theorem Suppose V is a neighborhood of 0 in a topological vector 


Space X. 


(a) If0<r,<r,<-:: andr,—- 0asn— ©, then 


(b) 
(c) 


Aa ey. 


n=1 


Every compact subset K of X is bounded. 


If 6, >0,>-:: and 6,70 as n- oc, and if V is bounded, then the 


collection 
10, on = 1, 2, 3,355) 
is a local base for X. 
PROOF. (a) Fix x € X. Since «a— ax is a continuous mapping of the 
scalar field into X, the set of all « with ax e V is open, contains 0, 


hence contains 1/r, for all large n. Thus (1/r,)x € V, or x er, V, for 
large n. 
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(b) Let W be a balanced neighborhood of 0 such that Wc JV. 
By (a), 


Since K is compact, there are integers n, < --- < n, such that 
Kcn,Wvo-:::on,w=n,w. 

The equality holds because W is balanced. If t > n,, it follows that 

KctWctv. 


(c) Let U be a neighborhood of 0 in X. If V is bounded, there 
exists s > 0 such that V c tU for all t > s. If n is so large that sé, < 1, 
it follows that V c (1/6,)U. Hence U actually contains all but finitely 
many of the sets 6, V. //// 


Linear Mappings 
1.16 Definitions When X and Y are sets, the symbol 
f: X7Y 


will mean that fis a mapping of X into Y. If A c X and Bc Y, the image 
f(A) of A and the inverse image or preimage f ~ *(B) of B are defined by 


f(A) = {f(x): x € A}, f (B) = {x: f(x) € B}. 


Suppose now that X and Y are vector spaces over the same scalar 
field. A mapping A: X — Y 1s said to be linear if 


A(ax + By) = aAx + BAy 


for all x and y in X and all scalars « and f. Note that one often writes Ax, 
rather than A(x), when A is linear. 

Linear mappings of X into its scalar field are called linear functionals. 

For example, the multiplication operators M, of Section 1.7 are linear, 
but the translation operators T, are not, except when a = 0. 

Here are some properties of linear mappings A: X > Y whose proofs 
are so easy that we omit them; it is assumed that dc X and Bc Y: 


(a) AO=0. 
(b) If A is a subspace (or a convex set, or a balanced set) the same is true 
of A(A). 


(c) If B is a subspace (or a convex set, or a balanced set) the same is true 
of A *(B). 
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(d) In particular, the set 
A~*({0}) = {x e X: Ax = 0} = W(A) 


is a subspace of X, called the null space of A. 


We now turn to continuity properties of linear mappings. 


1.17 Theorem Let X and Y be topological vector spaces. If A: X > Y is 
linear and continuous at 0, then A is continuous. In fact, A is uniformly contin- 
uous, in the following sense: To each neighborhood W of 0 in Y corresponds a 
neighborhood V of 0 in X such that 


y—x eV implies Ay — Ax € W. 


PROOF. Once W is chosen, the continuity of A at 0 shows that 
AV c W for some neighborhood V of 0. If now y — x € V, the linear- 
ity of A shows that Ay — Ax = A(y — x)e€ W. Thus A maps the 
neighborhood x + V of x into the preassigned neighborhood Ax + W 
of Ax, which says that A is continuous at x. //// 


1.18 Theorem Let A be a linear functional on a topological vector space 
X. Assume Ax # 0 for some x € X. Then each of the following four properties 
implies the other three: 


(a) A is continuous. 

(b) The null space N(A) is closed. 

(c) WA) is not dense in X. 

(d) A is bounded in some neighborhood V of 0. 


PROOF. Since (A) = A~ ‘({0}) and {0} is a closed subset of the scalar 
field ®, (a) implies (b). By hypothesis, (A) # X. Hence (b) implies (c). 

Assume (c) holds; 1e., assume that the complement of (A) has 
nonempty interior. By Theorem 1.14, 


(1) (x+V)A V(A)= © 


for some x € X and some balanced neighborhood V of 0. Then AV is 
a balanced subset of the field ®. Thus either AV is bounded, in which 
case (d) holds, or AV = ©. In the latter case, there exists y € V such 
that Ay = —Ax, and so x + ye W(A), in contradiction to (1). Thus 
(c) implies (d). 

Finally, if (d) holds then | Ax| < M for all x in V and for some 
M < oo. Ifr>0 and if W =(r/M)V, then | Ax| <r for every x in W. 
Hence A is continuous at the origin. By Theorem 1.17, this implies (a). 


//// 
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Finite-Dimensional Spaces 


1.19 Among the simplest Banach spaces are R” and ¢", the standard 
n-dimensional vector spaces over R and ¢, respectively, normed by means 
of the usual euclidean metric: If, for example, 


DAZ hice) (z, € C) 
is a vector in ¢”, then 
Z=(2P le 
Other norms can be defined on ¢". For example, 
[zi] =|z,) +---+|z,| or |z|| = max (|z;|: 1 <i<n). 


These norms correspond, of course, to different metrics on ¢” (when n > 1) 
but one can see very easily that they all induce the same topology on ¢”. 
Actually, more is true. 

If X is a topological vector space over ¢, and dim X = n, then every 
basis of X induces an isomorphism of X onto ¢". Theorem 1.21 will prove 
that this isomorphism must be a homeomorphism. In other words, this says 
that the topology of ¢” is the only vector topology that an n-dimensional 
complex topological vector space can have. 

We shall also see that finite-dimensional subspaces are always closed 
and that no infinite-dimensional topological vector space is locally 
compact. 

Everything in the preceding discussion remains true with real scalars 
in place of complex ones. 


1.20 Lemma _ If X is a complex topological vector space and f: €" > X is 
linear, then f is continuous. 


PROOF. Let {e,,..., €,} be the standard basis of ¢”: The kth coordi- 
nate of e, is 1, the others are 0. Put u, = f(e,), for k = 1, ..., n. Then 
f(z) =2,u, +++: +2,u, for every z = (z,,..., z,) in ¢”. Every z, is a 


continuous function of z. The continuity of fis therefore an immediate 
consequence of the fact that addition and scalar multiplication are 
continuous in X. //// 


1.21 Theorem I[f7n is a positive integer and Y is an n-dimensional sub- 
space of a complex topological vector space X, then 


(a) every isomorphism of €" onto Y is a homeomorphism, and 
(b) Y is closed. 


1.22 
finite 
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PROOF. Let S be the sphere which bounds the open unit ball B of 
¢". Thus zeS if and only if X|z,|? = 1, and ze B if and only if 
zz, |? < 1. 

Suppose f: ¢"-— Y is an isomorphism. This means that f is 
linear, one-to-one, and f(¢") = Y. Put K = f(S). Since f is continuous 
(Lemma 1.20), K is compact. Since f(0) = O and fis one-to-one, 0 € K, 
and therefore there is a balanced neighborhood V of 0 in X which 
does not intersect K. The set 


E=f"'(V)=f-'(V 0 Y) 


is therefore disjoint from S. Since fis linear, E is balanced, and hence 
connected. Thus E c B, because 0 € E, and this implies that the linear 
map f—! takes V - Y into B. Since f~* is an n-tuple of linear func- 
tionals on Y, the implication (d) > (a) in Theorem 1.18 shows that f~' 
is continuous. Thus fis a homeomorphism. 

To prove (b), choose pe Y, and let f and V be as above. For 
some t > 0, p € tV, so that p lies in the closure of 


Y a (tV) cf(tB) < f(tB). 


Being compact, f (tB) is closed in X. Hence p € f(tB) c Y, and this 


proves that Y= Y. //// 
Theorem Every locally compact topological vector space X has 
dimension. 


PROOF. The origin of X has a neighborhood V whose closure is 
compact. By Theorem 1.15, V is bounded, and the sets 2° "V (n = 1, 2, 
3, ...) form a local base for X. 

The compactness of V shows that there exist x,, ..., x, in X 
such that 


Vo(x, +4V)U-:- U(x, + 4V). 
Let Y be the vector space spanned by x,, ..., x,,. Then dim Y < m. 
By Theorem 1.21, Y is a closed subspace of X. 


Since V < Y + 4V and since AY = Y for every scalar A # 0, it 
follows that 


SV co Y+iV 
so that 
Vo VY+4Vio V+ VY+4V=Y+ Hv. 
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If we continue in this way, we see that 


Vc Q (Y +27"V). 


n=1 


Since {2~"V} is a local base, it now follows from (a) of Theorem 1.13 
that V c Y. But Y= Y. Thus V c Y, which implies that kV < Y for 
k = 1,2, 3,.... Hence Y = X, by (a) of Theorem 1.15, and consequent- 
ly dim X < m. //// 


1.23 Theorem [If X is a locally bounded topological vector space with the 
Heine-Borel property, then X has finite dimension. 


PROOF. By assumption, the origin of X has a bounded neighborhood 
V. Statement (f) of Theorem 1.13 shows that Vis also bounded. Thus 
Vis compact, by the Heine-Borel property. This says that X is locally 
compact, hence finite-dimensional, by Theorem 1.22. 


Metrization 


We recall that a topology t on a set X is said to be metrizable if there is a 
metric d on X which is compatible with t. In that case, the balls with radius 
1/n centered at x form a local base at x. This gives a necessary condition 
for metrizability which, for topological vector spaces, turns out to be also 
sufficient. 


1.24 Theorem If X is a topological vector space with a countable local 
base, then there is a metric d on X such that 


(a) dis compatible with the topology of X, 
(b) the open balls centered at 0 are balanced, and 
(c) dis invariant: d(x + z, y + z) = d(x, y) for x, y,z EX. 


If, in addition, X is locally convex, then d can be chosen so as to satisfy 
(a), (b), (c), and also 


(d) all open balls are convex. 


PROOF. By Theorem 1.14, X has a balanced local base {V,} such that 
(1) Vater. t Vari t+ Vari t+ Uri cv, (n+ 1, 2, 3, ...); 


when X is locally convex, this local base can be chosen so that each V, 
is also convex. 


CHAPTER 1: TOPOLOGICAL VECTOR SPACES 19 


Let D be the set of all rational numbers r of the form 


oO 


(2) r= Ye()2-", 
n=1 
where each of the “digits” c,r) is 0 or 1 and only finitely many are 1. 
Thus each r € D satisfies the inequalities 0 < r < 1. 
Put A(r) = X ifr > 1; for any r € D, define 


(3) A(r) = c,(r)V, + c2(r)V2 + c3(r)V3 + °°. 
Note that each of these sums is actually finite. Define 
(4) f(x) = inf {r: x e A()} (x € X) 
and 

(5) d(x, y)=f(x—y) (xe X, ye X). 


The proof that this d has the desired properties depends on the 
inclusions 


(6) A(r) + A(s) < A(r + s) (re D, s € D). 


Before proving (6), let us see how the theorem follows from it. 
Since every A(s) contains 0, (6) imples 


(7) A(r) < A(r) + A(t —r) < A(t) if r<t. 
Thus {A(r)} is totally ordered by set inclusion. We claim that 
(8) (x+y) sfx t+fly) (xe X, ye X). 


In the proof of (8) we may, of course, assume that the right side is <1. 
Fix ¢ > 0. There exist r and s in D such that 


f(x) <r, I(y) <5, r+s<f(x)+f(y) +6. 


Thus x € A(r), y € A(s), and (6) implies x + y € A(r +s). Now (8) 
follows, because 


S(x+y)srt+ts<f(x)+fly) +8, 
and é was arbitrary. 

Since each A(r) is balanced, f(x) = f(—x). It is obvious that 
f(0)=0. If x40, then x¢€V,=A(2~") for some n, and so 
f(x) =>2°">0. 

These properties of f show that (5) defines a translation-invariant 
metric d on X. The open balls centered at 0 are the open sets 


(9) BO) = {x: f(x) < d} = U Ae). 


If 6 <2~", then B,(0) < V,. Hence {B,(0)} is a local base for the topol- 
ogy of X. This proves (a). Since each A(r) is balanced, so is each B,(0). 
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If each V, is convex, so is each A(r), and (9) implies that the same is 
true of each B,(0), hence also of each translate of B,(0). 

We turn to the proof of (6). Ifr+s> 1, then A(r + s) = X and 
(6) is obvious. We may therefore assume that r + s < 1, and we will 
use the following simple proposition about addition in the binary 
system of notation: 

If r, s, and r + s are in D and ¢,{(r) + c,(s) # c,(r + s) for some n, 
then at the smallest n where this happens we have c,(r) = c,(s) = 0, 
c,(r +s) = 1. 

Put «, =c,(r), B, = c,(S), Yn = C,(r + 5). If «, + B, =, for all n 
then (3) shows that A(r) + A(s) = A(r + s). In the other case, let N be 
the smallest integer for which a, + By #yy. Then, as mentioned 
above, ay = By = 0, yy = 1. Hence 


A(r) < a,V, +-°° + Oy_ Vy_ 1 + Vay + Vwgn to": 
SOV, tee + hy Vyi a + Vg + Va 
Likewise 
A(s) < ByV, +°°> + By-1Vy—a + Vai + Vwi. 
Since a, + B, = y, for all n < N, (1) now leads to 
A(r) + A(s) < y1Vy + 07° + ¥n-1V-1 + Vy < Ar t+ 8) 


because yy = 1. [H/ 


1.25 Cauchy sequences (a) Suppose d is a metric on a set X. A 
sequence {x,} in X is a Cauchy sequence if to every ¢ > 0 there corresponds 
an integer N such that d(x,,, x,) < € whenever m>N and n> N. If every 
Cauchy sequence in X converges to a point of X, then d is said to be a 
complete metric on X. 


(b) Let t be the topology of a topological vector space X. The notion 
of Cauchy sequence can be defined in this setting without reference to any 
metric: Fix a local base @ for t. A sequence {x,,} in X is then said to be a 
Cauchy sequence if to every V € @ corresponds an N such that x, — x,,€ V 
ifn > Nandm>N. 

It is clear that different local bases for the same Tt give rise to the same 
class of Cauchy sequences. 

(c) Suppose now that X is a topological vector space whose topology 
t is compatible with an invariant metric d. Let us temporarily use the terms 
d-Cauchy sequence and t-Cauchy sequence for the concepts defined in (a) 
and (b), respectively. Since 


A(x, ’ Xm) = A(x, — Xm > 0), 
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and since the d-balls centered at the origin form a local base for t, we 
conclude: 


A sequence {x,} in X is a d-Cauchy sequence if and only if it is a 
t-Cauchy sequence. 


Consequently, any two invariant metrics on X that are compatible 
with t have the same Cauchy sequences. They clearly also have the same 
convergent sequences (namely, the t-convergent ones). These remarks prove 
the following fact: 


If d, and d, are invariant metrics on a vector space X which induce the 
same topology on X, then 


(a) d, and d, have the same Cauchy sequences, and 
(b) d, is complete if and only if d, is complete. 


Invariance is needed in the hypothesis (Exercise 12). 
The following “dilation principle” will be used several times. 


1.26 Theorem Suppose that (X, d,) and (Y, d,) are metric spaces, and 
(X, d,) is complete. If E is a closed set in X, f: E — Y is continuous, and 


d( f(x’), f(x") = dy’, x”) 
for all x’, x” € E, then f(E) is closed. 


PROOF. Pick y € f(E). There exist points x, € E so that y = lim f(x,). 
Thus { f(x,)} is Cauchy in Y. Our hypothesis implies therefore that 
{x,} is Cauchy in X. Being a closed subset of a complete metric space, 
E is complete; hence there exists x = lim x, in E. Since f is contin- 
uous, 


f(x) = lim f(x,) = y. 
Thus y € f (E). [III 
1.27 Theorem Suppose Y is a subspace of a topological vector space X, 


and Y is an F-space (in the topology inherited from X). Then Y is a closed 
subspace of X. 


PROOF. Choose an invariant metric d on Y, compatible with its topol- 
ogy. Let 


1 
Bain = . ey: dy, 0) < “t 
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let U, be a neighborhood of 0 in X such that Y n U, = B,,,, and 
choose symmetric neighborhoods V, of 0 in X such thatV,+ Vc U, 
and V,,,< V,. 

Suppose x € Y, and define 


E,= Y a(x + V,) (i= My Sah 05)) 


If y, e E, and y, € E,, then y, — y, lies in Y and also in V, + Vic 
U,,, hence in B,,,. The diameters of the sets E, therefore tend to 0. 
Since each E, is nonempty and since Y is complete, it follows that the 
Y-closures of the sets E, have exactly one point y, in common. 

Let W be a neighborhood of 0 in X, and define 


F,=Yon(x+wWwo V,). 


The preceding argument shows that the Y-closures of the sets F,, have 
one common point yy. But F, < E,. Hence yw = yo. Since F, < 
x + W, it follows that y, lies in the X-closure of x + W, for every W. 
This implies yy = x. Thus x € Y. This proves that Y= Y. //// 


The following simple facts are sometimes useful. 


1.28 Theorem 


(a) If dis atranslation-invariant metric on a vector space X then 
d(nx, 0) < nd(x, 0) 


for every x € X and for n = 1, 2, 3,.... 


(b) If {x,} is a sequence in a metrizable topological vector space X and if 
xX, 0 as n— oo, then there are positive scalars y,, such that y, - 0 and 
YnX, > 0. 


PROOF. Statement (a) follows from 
d(nx, 0) < > d(kx, (k — 1)x) = nd(x, 0). 
k=1 


To prove (b), let d be a metric as in (a), compatible with the 
topology of X. Since d(x,, 0) 0, there is an increasing sequence of 
positive integers n, such that d(x,,0)<k~* ifn>n,. Put y, =1 if 
n<n,;puty, =kifn, <n <n,,,. For such na, 


Ay, Xn> 0) = d(kx,, 0) < kd(x,, 0) < k7'. 


Hence y, x, ~ 0 asn— oo. [// 
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Boundedness and Continuity 


1.29 Bounded sets The notion of a bounded subset of a topological 
vector space X was defined in Section 1.6 and has been encountered several 
times since then. When X is metrizable, there is a possibility of misunder- 
standing, since another very familiar notion of boundedness exists in metric 
spaces. 

If d is a metric on a set X, a set E < X is said to be d-bounded if there 
is anumber M < o such that d(z, y) < M for all x and yin E. 

If X is a topological vector space with a compatible metric d, the 
bounded sets and the d-bounded ones need not be the same, even if d is 
invariant. For instance, if d is a metric such as the one constructed in 
Theorem 1.24, then X itself is d-bounded (with M = 1) but, as we shall see 
presently, X cannot be bounded, unless X = {0}. If X is a normed space 
and d is the metric induced by the norm, then the two notions of 
boundedness coincide; but if d is replaced by d, = d/(1 + d) (an invariant 
metric which induces the same topology) they do not. 

Whenever bounded subsets of a topological vector space are dis- 
cussed, it will be understood that the definition is as in Section 1.6: A set E is 
bounded if, for every neighborhood V of 0, we have Ec tV for all suffi- 
ciently large t. 

We already saw (Theorem 1.15) that compact sets are bounded. To see 
another type of example, let us prove that Cauchy sequences are bounded 
(hence convergent sequences are bounded): If {x,} is a Cauchy sequence in X, 
and V and W are balanced neighborhoods of 0 with V + V c W, then 
[part (b) of Section 1.25] there exists N such that x, € xy + V foralln>N. 
Take s > 1 so that xy € sV. Then 


x, E€SV+tVesV+sVcsW (n => N). 


Hence x, € tW for all n > 1, if t is sufficiently large. 

Also, closures of bounded sets are bounded (Theorem 1.13). 

On the other hand, if x #0 and E = {nx: n= 1, 2, 3, ...}, then E is 
not bounded, because there is a neighborhood V of 0 that does not contain 
x; hence nx is not in nV;; it follows that no nV contains E. 

Consequently, no subspace of X (other than {0}) can be bounded. 

The next theorem characterizes boundedness in terms of sequences. 


1.30 Theorem The following two properties of a set E in a topological 
vector space are equivalent: 


(a) Eis bounded. 


(b) If {x,} is a sequence in E and {a,$ is a sequence of scalars such that 
a, —7>O0asn— o, thena,x, ~0asn—- o. 
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PROOF. Suppose E is bounded. Let V be a balanced neighborhood 
of 0 in X. Then Ec tV for some t. If x, € E and a, > 0, there exists 
N such that |a,|t <1 if n> N. Since t-'E c V and V is balanced, 
a, x, € V for alln > N. Thus a,x, > 0. 

Conversely, if E is not bounded, there is a neighborhood V of 0 
and a sequence r, — o0 such that nor, V contains E. Choose x, € E 
such that x, ¢ r,V. Then no r, ‘x, is in V, so that {r,‘x,} does not 
converge to 0. //// 


1.31 Bounded linear transformations Suppose X and Y are topologi- 
cal vector spaces and A: X — Y 1s linear. A is said to be bounded if A maps 
bounded sets into bounded sets, 1.e., if A(E) is a bounded subset of Y for 
every bounded set Ec X. 

This definition conflicts with the usual notion of a bounded function 
as being one whose range is a bounded set. In that sense, no linear function 
(other than 0) could ever be bounded. Thus when bounded linear mappings 
(or transformations) are discussed, it is to be understood that the definition 
is in terms of bounded sets, as above. 


1.32 Theorem Suppose X and Y are topological vector spaces and 
A: X — Y is linear. Among the following four properties of A, the implications 


(a) > (b) > (c) 
hold. If X is metrizable, then also 
(c) > (d) > (a), 


so that all four properties are equivalent. 


(a) A is continuous. 

(b) A is bounded. 

(c) Ifx, 70 then {Ax,:n = 1, 2, 3,...} is bounded. 
(d) Ifx,—0 then Ax, — 0. 


Exercise 13 contains an example in which (b) holds but (a) does not. 


PROOF. Assume (a), let E be a bounded set in X, and let W be a 
neighborhood of 0 in Y. Since A is continuous (and AO = 0) there is a 
neighborhood V of 0 in X such that A(V) c W. Since E is bounded, 
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E c tV for all large t, so that 
A(E) < A(tV) = tA(V) < tW. 


This shows that A(E) is a bounded set in Y. 

Thus (a)—(b). Since convergent sequences are bounded, 
(b) > (c). 

Assume now that X is metrizable, that A satisfies (c), and that 
x, > 0. By Theorem 1.28, there are positive scalars y, — oo such that 
YnX, 70. Hence {A(y,x,)} is a bounded set in Y, and now Theorem 
1.30 implies that 


Ax, =7, ‘A(y,X,) 20 as noo. 


Finally, assume that (a) fails. Then there is a neighborhood W of 
0 in Y such that A~‘(W) contains no neighborhood of 0 in X. If X 
has a countable local base, there is therefore a sequence {x,} in X so 
that x, > 0 but Ax, ¢ W. Thus (d) fails. //// 


Seminorms and Local Convexity 


1.33 Definitions A seminorm on a vector space X is a real-valued func- 
tion p on X such that 


(a) p(x + y) < p(x) + p(y) and 
(b) p(ax) = | «| p(x) 


for all x and yin X and all scalars «. 


Property (a) 1s called subadditivity. Theorem 1.34 will show that a semi- 
norm p is a norm if it satisfies 


(c) p(x) #0 x #0. 


A family 7 of seminorms on _X is said to be separating if to each x # 0 
corresponds at least one p € F with p(x) ¥ 0. 

Next, consider a convex set A < X which is absorbing, in the sense 
that every x €e X lies in tA for some t = t(x) > 0. [For example, (a) of 
Theorem 1.15 implies that every neighborhood of 0 in a topological vector 
space is absorbing. Every absorbing set obviously contains 0.] The 
Minkowski functional 1, of A is defined by 


u(x) = inf {t > 0: t~'x € A} (x € X). 


Note that ,(x) < oo for all x € X, since A is absorbing. The seminorms on 
X will turn out to be precisely the Minkowski functionals of balanced 
convex absorbing sets. 
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Seminorms are closely related to local convexity, in two ways: In 


every locally convex space there exists a separating family of continuous 
seminorms. Conversely, if P is a separating family of seminorms on a vector 
space X, then Y can be used to define a locally convex topology on X with 
the property that every pe is continuous. This is a frequently used 
method of introducing a topology. The details are contained in Theorems 
1.36 and 1.37. 


1.34 Theorem Suppose p is a seminorm ona vector space X. Then 


(a) 
(b) 
(c) 
(d) 
(e) 


n(0) = 0. 

| p(x) — p(y)| < p(x — y). 

p(x) = 0. 

{x: p(x) = 0} is a subspace of X. 

The set B = {x: p(x) < 1} is convex, balanced, absorbing, and p = p,. 


PROOF. Statement (a) follows from p(ax) = |«| p(x), with « = 0. The 
subadditivity of p shows that 


P(x) = p(x — y + y) S P(x — y) + Ply) 
so that p(x) — p(y) < p(x — y). This also holds with x and y inter- 
changed. Since p(x — y) = p(y — x), (b) follows. With y = 0, (b) implies 
(c). If p(x) = p(y) = 0 and a, B are scalars, (c) implies 


0 < plax + By) <|a| p(x) + | Bl p(y) = 9. 
This proves (d). 
As to (e), it is clear that B is balanced. If x € B, ye B, and 
0<t< 1, then 


p(tx + (1 — t)y) < tp(x) + (1 — p(y) < 1. 
Thus B is convex. If x € X and s > p(x) then p(s~'x) = s~ *p(x) < 1. 
This shows that B is absorbing and also that y,(x) < s. Hence pp, < p. 
But if 0 <t < p(x) then p(t~'x) > 1, and so t~'x is not in B. This 
implies p(x) < u,(x) and completes the proof. Hid 


1.35 Theorem Suppose A is a convex absorbing set in a vector space X. 
Then 


(a) pg(x + y) < wal) + wAly). 
(b)  py(tx) = tug(x) ift > 0. 
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(c) pt, is a seminorm if A is balanced. 
(dq) If B= {x: p(x) <1} and C= {x:yu,(x) < 1}, then BC ACC and 
Hp = La = Ue: 


PROOF. If t = u,(x) + e and s = u,(y) + &, for some ¢ > 0, then x/t and 
y/s are in A; hence so is their convex combination 


ey t x sy 


SEE SHR EF 64h 


This shows that y,(x + y)<s+t=uyp,(x) + u,(y) + 26, and (a) is 
proved. 

Property (b) is clear, and (c) follows from (a) and (5). 

When we turn to (d), the inclusions B <c A < C show that uc < 
[4 < Ug. TO prove equality, fix x e« X, and choose s, t so that u(x) < 
s<t. Then x/seC, u,(x/s) < 1, w,(x/t) < s/t < 1; hence x/t € B, so 
that u(x) < t. This holds for every t > u(x). Hence up(x) < U(x). //// 


1.36 Theorem Suppose @ is a convex balanced local base in a topologi- 
cal vector space X. Associate to every V € & its Minkowski functional py . 
Then 


(a) V={xeX: p(x) < 1}, for every V € B, and 
(b) {py: V € B} is a separating family of continuous seminorms on X. 


PROOF. If x Ee V, then x/t e V for some t <1, because V is open; 
hence p,(x) < 1. If x € V, then x/t e V implies t > 1, because V is 
balanced; hence p,(x) > 1. This proves (a). 

Theorem 1.35 shows that each py is a seminorm. If r > 0, it 
follows from (a) and Theorem 1.34 that 


| Uy(x) — My(y)| < wy(x — y) <r 


ifx — yerV. Hence py, is continuous. If x e X and x #0, then x € V 
for some V e€ &. For this V, w,{x) > 1. Thus {u,} is separating. —_//// 


1.37 Theorem Suppose F is a separating family of seminorms on a vector 
space X. Associate to each p € P and to each positive integer n the set 


1 
V(p, n) = ‘x p(x) < “t. 
Let & be the collection of all finite intersections of the sets V(p, n). Then & is 


a convex balanced local base for a topology t on X, which turns X into a 
locally convex space such that 
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(a) every p € P is continuous, and 
(b) aset E < X is bounded if and only if every p € F is bounded on E. 


PROOF. Declare a set A < X to be open if and only if A is a (possibly 
empty) union of translates of members of &. This clearly defines a 
translation-invariant topology t on X; each member of & is convex 
and balanced, and & is a local base for t. 

Suppose x € X, x #0. Then p(x) > 0 for some p € #. Since x is 
not in V(p, n) if np(x) > 1, we see that 0 is not in the neighborhood 
x — V(p, n) of x, so that x is not in the closure of {0}. Thus {0} is a 
closed set, and since t is translation-invariant, every point of X is a 
closed set. 

Next we show that addition and scalar multiplication are con- 
tinuous. Let U be a neighborhood of 0 in X. Then 


(1) Ud V (Pi, ny) nat MS V(Dins Mm) 
for some p,,..., Py € A and some positive integers n,,...,n,,. Put 
(2) V = V(P,, 2n;) a aaa a VD» 2n,): 


Since every p € F# is subadditive, V + V c U. This proves that addi- 
tion is continuous. 

Suppose now that x e€ X, a is a scalar, and U and V are as 
above. Then x € sV for some s > 0. Put t= s/1 + |a|s). Ifypex+tV 
and | B — a| < 1/s, then 


By — ax = Bly — x) + (B — a)x 
which lies in 
[BIoV+ |B-—alsVcoV+VcU 


since |B|t < 1 and V is balanced. This proves that scalar multiplica- 
tion is continuous. 

Thus X is a locally convex space. The definition of V(p, n) shows 
that every p € F is continuous at 0. Hence p is continuous on X, by 
(b) of Theorem 1.34. 

Finally, suppose E < X is bounded. Fix p € #. Since V(p, 1) is a 
neighborhood of 0, E < kV(p, 1) for some k < oo. Hence p(x) < k for 
every x € E. It follows that every p € Ais bounded on E. 

Conversely, suppose E satisfies this condition, U is a neighbor- 
hood of 0, and (1) holds. There are numbers M; < o such that p; < 
M,; on E (1<i<m). If n>M,n, for 1<i<m, it follows that 
E <nU, so that E is bounded. //// 


1.38 Remarks (a) It was necessary to take finite intersections of the sets 
V(p, n) in Theorem 1.37; the sets V(p, n) themselves need not form a local 
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base. (They do form what is usually called a subbase for the constructed 
topology.) To see an example of this, take X = R’, and let P consist of the 
seminorms p, and p, defined by px) =|x;|; here x = (x,, x2). Exercise 8 
develops this comment further. 

(b) Theorems 1.36 and 1.37 raise a natural problem: If @ is a convex 
balanced local base for the topology t of a locally convex space X, then # 
generates a separating family F of continuous seminorms on X, as in 
Theorem 1.36. This Y in turn induces a topology t, on X, by the process 
described in Theorem 1.37. Is t = t,? 

The answer is affirmative. To see this, note that every pe VY is t- 
continuous, so that the sets V(p, n) of Theorem 1.37 are in t. Hence t, < t. 
Conversely, if W € @ and p = py, then 


W = {x: p(x) < 1} = V@, 1). 
Thus W et, for every W € &; this implies that t c 7,. 

(c) If P = {p;: i = 1, 2, 3, ...} is a countable separating family of semi- 
norms on X, Theorem 1.37 shows that FY induces a topology t with a 
countable local base. By Theorem 1.24, t is metrizable. In the present situ- 
ation, a compatible translation-invariant metric can be defined directly in 
terms of {p;} by setting 


ci PAX — Y) 
(1) d(x, y) = max ~~~, 
; 1+ p{x—y) 
where {c;,} is some fixed sequence of positive numbers which converges to 0 
as 1— 0. 
It is easy to verify that d is a metric on X. 
We claim that the balls 


(2) B, = {x: d(0, x) <r} (0<r<o) 


form a convex balanced local base for t. 

Fix r. If c; <r (which holds for all but finitely many i, since c; > 0), 
then c; p;/(1 + p;) <r. Hence B, is the intersection of finitely many sets of 
the form 


(3) * p(x) < — 
c;—Tr 


namely those for which c; > r. These sets are open, since each p, is contin- 
uous (Theorem 1.37). Thus B, is open, and, by Theorem 1.34, is also convex 
and balanced. 

Next, let W be a neighborhood of 0 in X. The definition of t shows 
that W contains the intersection of appropriately chosen sets 


(4) V(p;, 0;) = {x: p{x) < 6; < 1} (1 <i<k). 
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If 2r < min {c,6,,..., ¢,6,} and x € B,, then 
C; p(X) C0; 
—— —— 1<i<hk), 
©) 1 + px) 2 Cate 


which implies p{x) < 6;. Thus B, c W. 
This proves our claim and also shows that dis compatible with t. 


1.39 Theorem A topological vector space X is normable if and only if its 
origin has a convex bounded neighborhood. 


PROOF. If X is normable, and if || - || is a norm that is compatible with 
the topology of X, then the open unit ball {x: ||x|| < 1} is convex and 
bounded. 


For the converse, assume V is a convex bounded neighborhood 
of 0. By Theorem 1.14, V contains a convex balanced neighborhood 
U of 0; of course, U is also bounded. Define 


(1) IIx] = wx) = (xe X) 


where p is the Minkowski functional of U. 

By (c) of Theorem 1.15, the sets rU (r > 0) form a local base for 
the topology of X. If x #0, then x ¢rU for some r>0; hence 
|x|| >r. It now follows from Theorem 1,35 that (1) defines a norm. 
The definition of the Minkowski functional, together with the fact that 
U is open, implies that 


(2) {x: ||x]| <r} =rU 


for every r > 0. The norm topology coincides therefore with the given 


one. //// 


Quotient Spaces 


1.40 Definitions Let N be a subspace of a vector space X. For every 
x € X, let 2(x) be the coset of N that contains x; thus 
mx)=x4+N. 


These cosets are the elements of a vector space X/N, called the quotient 
space of X modulo N, in which addition and scalar multiplication are 
defined by 


(1) mx) + my) = n(x + y), an(x) = (ax). 


[Note that now az(x) = N when « = 0. This differs from the usual notation, 
as introduced in Section 1.4.] Since N is a vector space, the operations (1) 
are well defined. This means that if x(x) = x(x’) (that is, x’ — x € N) and 
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my) = ny’) then 
(2) mx) + ny) = n(x’) + ay), an(x’) = an(x). 


The origin of X/N is 2(0) = N. By (1), z is a linear mapping of X onto 
X/N with N as its null space; z is often called the quotient map of X onto 
X/N. 

Suppose now that t is a vector topology on X and that N is a closed 
subspace of X. Let ty be the collection of all sets Ec X/N for which 
n ‘(E) € t. Then ty turns out to be a topology on X/N, called the quotient 
topology. Some of its properties are listed in the next theorem. Recall that 
an open mapping is one that maps open sets to open sets. 


1.41 Theorem Let N be a closed subspace of a topological vector space 
X. Let t be the topology of X and define ty as above. 


(a) ty is a vector topology on X/N; the quotient map x: X — X/N is linear, 
continuous, and open. 


(b) If Bisa local base for t, then the collection of all sets n(V) with V € B 
is a local base for ty . 


(c) Each of the following properties of X is inherited by X/N: local convex- 
ity, local boundedness, metrizability, normability. 


(d) If X is an F-space, or a Fréchet space, or a Banach space, so is X/N. 


PROOF. Since z~*(A A B) = 2 '(A) A 2x *(B) and 


am () E,)= ) nm *(E,), 
ty is a topology. A set F < X/N is ty-closed if and only if x~ ‘(F) is 
t-closed. In particular, every point of X/N is closed, since 
n ‘(n(x)) =N+x 


and N was assumed to be closed. 
The continuity of 2 follows directly from the definition of ty. 
Next, suppose V é t. Since 


tn ‘(n(V))=N+V 


and N + V €1, it follows that 2(V) € ty. Thus z is an open mapping. 
If now W is a neighborhood of 0 in X/N, there is a neighbor- 
hood V of 0in X such that 
VtVcen '(W). 


Hence 2(V) + 2(V) < W. Since z is open, z(V) is a neighborhood of 0 
in X/N. Addition is therefore continuous in X/N. 
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The continuity of scalar multiplication in X/N is proved in the 
same manner. This establishes (a). 

It is clear that (a) implies (b). With the aid of Theorems 1.32, 
1.24, and 1.39, it is just as easy to see that (b) implies (c). 

Suppose next that d is an invariant metric on X, compatible 
with t. Define p by 


p(x(x), m(y)) = inf {d(x — y, z):z Ee N}. 


This may be interpreted as the distance from x — y to N. We omit the 
verifications that are now needed to show that p is well defined and 
that it is an invariant metric on X/N. Since 


m{x: d(x, 0) < r}) = {u: p(u, 0) <r}, 


it follows from (b) that p is compatible with ty. 
If X is normed, this definition of p specializes to yield what is 
usually called the quotient norm of X/N: 


|x(x)|| = inf {|]x — z]]: z € N}. 


To prove (d) we have to show that p is a complete metric when- 
ever d is complete. 

Suppose {u,} is a Cauchy sequence in X/N, relative to p. There 
is a subsequence {u,.} with p(u,,, U,,,,)< 27". One can then induc- 
tively choose x; € X such that x(x,;) = u,, and d(x;, X;4,) < 27" If d is 
complete, the Cauchy sequence {x;} converges to some x € X. The 
continuity of x implies that U,, > W(x) as i> oo. But if a Cauchy 
sequence has a convergent subsequence then the full sequence must 
converge. Hence p is complete, and so is the proof of Theorem 1.41. 


//// 


Here is an easy application of these concepts: 


Theorem Suppose N and F are subspaces of a topological vector 


space X, N is closed, and F has finite dimension. Then N + F is closed. 


PROOF. Let z be the quotient map of X onto X/N, and give X/N its 
quotient topology. Then z(F) is a finite-dimensional subspace of X/N; 
since X/N is a topological vector space, Theorem 1.21 implies that 
mF) is closed in X/N. Since N + F = 27 ‘(x(F)) and z is continuous, 
we conclude that N + F is closed. (Compare Exercise 20.) Ly 


1.43 Seminorms and quotient spaces Suppose p is a seminorm on a 
vector space X and 


N = {x: p(x) = 0}. 
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Then N is a subspace of X (Theorem 1.34). Let z be the quotient map of X 
onto X/N, and define 


P(x(x)) = p(x). 
If x(x) = x(y), then p(x — y) = 0, and since 


| p(x) — ply)| < p(x — y) 


it follows that p(z(x)) = p(x(y)). Thus p is well defined on X/N, and it is now 
easy to verify that p is a norm on X/N. 

Here is a familiar example of this. Fix r, 1 < r < oo; let I’ be the space 
of all Lebesgue measurable functions on [0, 1] for which 


1 1/r 
AS) =f l,= 1 | FCI" i} < 0. 


This defines a seminorm on I, not a norm, since || f ||, = 0 whenever f = 0 
almost everywhere. Let N be the set of these “null functions.” Then L/N 1s 
the Banach space that is usually called Z. The norm of I is obtained by the 
above passage from p to p. 


Examples 


1.44 The spaces C(Q) If Q is a nonempty open set in some euclidean 
space, then Q is the union of countably many compact sets K, # @ which 
can be chosen so that K, lies in the interior of K,,, (n = 1, 2, 3, ...). C(Q) 1s 
the vector space of all complex-valued continuous functions on Q, topol- 
ogized by the separating family of seminorms 


(1) PAS) = sup {| f(x)|: x € K,}, 


in accordance with Theorem 1.37. Since p, < p, < ---, the sets 


I 
(2) r= {fe C1: pf) <>} (n = 1, 2, 3, ...) 


form a convex local base for C(Q). According to remark (c) of Section 1.38, 
the topology of C(Q) is compatible with the metric 


2°"PAS — 9) 

eo) me L + Pf — 9) 
If { f;} is a Cauchy sequence relative to this metric, then p,(f; — f;) > 0 for 
every n, as i, j > 00, so that { f;} converges uniformly on K,,, to a function 
f € CQ). An easy computation then shows d(f, f;) > 0. Thus d is a complete 
metric. We have now proved that C(Q) is a Fréchet space. 

By (b) of Theorem 1.37, a set E ¢ C(Q) is bounded if and only if there 
are numbers M, < oo such that p,(f) < M, for all fe E; explicitly, 


(4) If()| <M,  iffe EandxeK,. 
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Since every V, contains an f for which p,,,(/) is as large as we please, it 
follows that no V, is bounded. Thus C(Q) is not locally bounded, hence is not 
normable. 


1.45 The spaces H(Q) Let QO now be a nonempty open subset of the 
complex plane, define C(Q) as in Section 1.44, and let H(Q) be the subspace 
of C(Q) that consists of the holomorphic functions in Q. Since sequences of 
holomorphic functions that converge uniformly on compact sets have holo- 
morphic limits, H(Q) is a closed subspace of C(Q). Hence H(Q) is a Fréchet 
space. 

We shall now prove that H(Q) has the Heine-Borel property. It will 
then follow from Theorem 1.23 that H(Q) is not locally bounded, hence is not 
normable. 

Let E be a closed and bounded subset of H(Q). Then E satisfies 
inequalities such as (4) of Section 1.44. Montel’s classical theorem about 
normal families (Th. 14.6 of [23]') implies therefore that every sequence 
{ f;} < E has a subsequence that converges uniformly on compact subsets of 
Q [hence in the topology of H(Q)] to some f € H(Q). Since E is closed, f € E. 
This proves that E is compact. 


1.46 The spaces C?(Q) and Zx We begin this section by introducing 
some terminology that will be used in our later work with distributions. 

In any discussion of functions of n variables, the term multi-index 
denotes an ordered n-tuple 


(1) a =(a,,..., &,) 


of nonnegative integers «;. With each multi-index « is associated the differ- 
ential operator 


6 \" a \ 
™ ae a 


whose order is 
(3) laj=a,+-''+4,. 


If|a| = 0, Df=f. 
A complex function f defined in some nonempty open set Qc R” is 
said to belong to C%(Q) if D?f € C(Q) for every multi-index a. 


’ Numbers in brackets refer to sources listed in the bibliography. 
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The support of a complex function f (on any topological space) is the 
closure of {x: f(x) # 0}. 

If K is a compact set in R", then DB, denotes the space of all 
f € C™”(R") whose support lies in K. (The letter D has been used for these 
spaces ever since Schwartz published his work on distributions.) If K < Q, 
then D, may be identified with a subspace of C”(Q). 

We now define a topology on C®(Q) which makes C™(Q) into a Fréchet 
space with the Heine-Borel property, such that Dy is a closed subspace of 
C”(Q) whenever K <Q. 

To do this, choose compact sets K; (i = 1, 2, 3, ...) such that K, lies in 
the interior of K;,, and Q = |) K;. Define seminorms py on C®(Q), N = 1, 
2, 3,..., by setting 


(4) Pr(f) = max {| D’f(x)|: x € Ky, |a| < N}. 


They define a metrizable locally convex topology on C®(Q); see Theorem 
1.37 and remark (c) of Section 1.38. For each x € Q, the functional f— f(x) 
is continuous in this topology. Since D, is the intersection of the null spaces 
of these functionals, as x ranges over the complement of K, it follows that 
Q, is closed in C®(Q). 

A local base is given by the sets 


1 
(5) y= |fe C™O): rl < Zt (N = 1, 2, 3,...). 


If {f;} is a Cauchy sequence in C%(Q) (see Section 1.25) and if N is fixed, 
then f; — f; € Vy if i and j are sufficiently large. Thus | D*f, — D*f;| < 1/N on 
Ky, if |a| < N. It follows that each D*f; converges (uniformly on compact 
subsets of Q) to a function g,. In particular, f(x) > go(x). It is now evident 
that gp €e C~(Q), that g, = D%g,, and that f; > g in the topology of C”(Q). 

Thus C%(Q) is a Fréchet space. The same is true of each of its closed 
subspaces Dy. 

Suppose next that E <c C“%(Q) 1s closed and bounded. By Theorem 
1.37, the boundedness of E is equivalent to the existence of numbers 
My < © such that p,(f) < My for N = 1, 2, 3, ... and for all fe E. The 
inequalities | D*f |< My, valid on Ky when |a| <N, imply the equicon- 
tinuity of {D*f: fe E} on Ky_,, if |B|<N—1. It now follows from 
Ascoli’s theorem (proved in Appendix A) and Cantor’s diagonal process 
that every sequence in E contains a subsequence { f;} for which {D*f;} con- 
verges, uniformly on compact subsets of Q, for each multi-index fB. Hence 
{ f;} converges in the topology of C°(Q). This proves that E is compact. 

Hence C~(Q) has the Heine-Borel property. It follows from Theorem 
1.23 that C°(Q) is not locally bounded, hence not normable. The same con- 
clusion holds for ZY, whenever K has nonempty interior (otherwise Dy = 
{0}), because dim Y, = oo in that case. This last statement is a consequence 
of the following proposition: 
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If B, and B, are concentric closed balls in R", with B, in the interior of 
B,, then there exists @ € C™(R") such that $(x)=1 for every xe B,, 
g(x) = 0 for every x outside B,, and0 < @ < 1 on R". 


To find such a ¢, we construct g € C%(R‘) such that g(x) =0 for 
x <a, g(x) = 1 for x > b (where 0 < a <b < o are preassigned) and put 


(6) P(X 1, ---5 X,) = 1 — G(x? +°°+ + x?). 


The following construction of g has the advantage that suitable choices of 
{5; can lead to functions with other desired properties. 

Suppose 0 <a<b< oo. Choose positive numbers 6), 6;, 62, ..., 
with &6; = b — a; put 


7 = a) Cy Ae pee 
(7 m=z = 12,3, 


n 


let fp be a continuous monotonic function such that fo(x) = 0 when x < a, 
fo(x) = 1 when x > a+ 6,; and define 


1 x 
(8) fl) => [4-10 dt (n=1,2,3,..,). 


Differentiation of this integral shows, by induction, that f, has n continuous 
derivatives and that | D"f,| < m,.Ifn>_r, then 


1 (% 
9) Dix) => | (Df, Nx — 1) dt, 
so that 
(10) |Df,l|<m, (n2n), 


again by induction on n. The mean value theorem, applied to (9), shows 
that 
(11) [Df — Dfr-1|Sm,416, (ner 2). 


Since 26, < oo, each {D’f,} converges, uniformly on (— 0, 00), as n— oo. 
Hence { f,} converges to a function g, with |D’g| < m, for r = 1, 2, 3,..., 
such that g(x) = 0 for x < aand g(x) = 1 for x > b. 


1.47 The spaces L? with 0 <p<1 Consider a fixed p in this range. 
The elements of I? are those Lebesgue measurable functions f on [0, 1] for 
which 


(1) nin= | | f(Q|? dt < c, 
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with the usual identification of functions that coincide almost everywhere. 
Since 0 < p < 1, the inequality 


(2) (a+b)? <a’ +b? 
holds when a > O and b > O. This gives 

(3) A(f+ g) < A(f) + Ag), 
so that 

(4) d(f, 9) = A(f — 9) 


defines an invariant metric on I?. That this d 1s complete is proved in the 
Same way as in the familiar case p > 1. The balls 
(5) B,= {fe BP: A(f) <r} 
form a local base for the topology of I?. Since B, = r~'/?B,, for all r > 0, 
B, 1s bounded. 

Thus L? is a locally bounded F-space. 

We claim that I? contains no convex open sets, other than @ and L’. 

To prove this, suppose V # @ is open and convex in I?. Assume 
0 € V, without loss of generality. Then V > B,, for some r > 0. Pick fe L’. 
Since p< 1, there is a positive integer n such that n?~' A(f) <r. By the 
continuity of the indefinite integral of | f |?, there are points 


O=X9 <x, < °° <x, = 1 
such that 


(6) | | f() |? dt =n-* ACL) (1 <i<n). 
Xi-1 

Define gt) = nf(t) 1f x;_, <t < x,, gt) = 0 otherwise. Then g; € V, since 

(6) shows 


(7) A(g;) = n?~' Af) <r (l<i<n) 


and V > B.. Since V 1s convex and 


1 
(8) f= 71+" + Go): 


it follows that fe V. Hence V = L’. 

This lack of convex open sets has a curious consequence. 

Suppose A: I? > Y is a continuous linear mapping of I? into some 
locally convex space Y. Let @ be a convex local base for Y. If W € &, then 
A~'(W) is convex, open, not empty. Hence A7'(W) = I?. Consequently, 
A(L?) < W for every W € &. We conclude that Af = 0 for every f € L. 

Thus 0 is the only continuous linear mapping of I? into any locally 
convex space Y, if 0 < p <1. In particular, 0 is the only continuous linear 
functional on these I?-spaces. 

This is, of course, in violent contrast to the familiar case p > 1. 
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Exercises 


1. Suppose X is a vector space. All sets mentioned below are understood to be 
subsets of X. Prove the following statements from the axioms as given in 
Section 1.4. (Some of these are tacitly used in the text.) 

(a) Ifx e X and ye X there is a unique z € X suchthatx+z=y. 

(b) Ox = 0 = aOif x € X and wis a scalar. 

(c) 2A < A+ A;it may happen that 2A # A+ A. 

(d) Ais convex if and only if (s + t)A = sA + tA for all positive scalars s and t. 

(e) Every union (and intersection) of balanced sets is balanced. 

(f) Every intersection of convex sets is convex. 

(g) If T is a collection of convex sets that is totally ordered by set inclusion, 
then the union of all members of I" is convex. 

(h) If A and B are convex, so is A + B. 

(i) If A and B are balanced, so is A + B. 

(j) Show that parts (/), (g), and (h) hold with subspaces in place of convex sets. 

2. The convex hull of a set A in a vector space X is the set of all convex com- 
binations of members of A, that is, the set of all sums 


1X4 + yee ae oe, 


in which x; € A, t; > 0, ¥) t; = 1; n is arbitrary. Prove that the convex hull of A 
is convex and that it is the intersection of all convex sets that contain A. 

3. Let X be a topological vector space. All sets mentioned below are understood to 

be the subsets of X. Prove the following statements. 

(a) The convex hull of every open set is open. 

(b) If X is locally convex then the convex hull of every bounded set is bounded. 
(This is false without local convexity; see Section 1.47.) 

(c) If A and B are bounded, so is A + B. 

(dq) If A and B are compact, so is A + B. 

(e) If A is compact and B is closed, then A + B is closed. 

(f) The sum of two closed sets may fail to be closed. [The inclusion in (b) of 
Theorem 1.13 may therefore be strict. ] 

4. Let B= {(z,, z,) € ¢7: |z,| <|z,.|}. Show that B is balanced but that its inte- 
rior is not. [Compare with (e) of Theorem 1.13.] 

5. Consider the definition of “bounded set” given in Section 1.6. Would the 
content of this definition be altered if it were required merely that to every 
neighborhood V of 0 corresponds some t > 0 such that Ec tV? 

6. Prove that a set E in a topological vector space is bounded if and only if every 
countable subset of E is bounded. 

7. Let X be the vector space of all complex functions on the unit interval [0, 1], 
topologized by the family of seminorms 


PAPD=1f)| Osx <I). 


This topology is called the topology of pointwise convergence. Justify this ter- 
minology. 

Show that there is a sequence { f,} in X such that (a) { f,} converges to 0 
as n— oo, but (b) if {y,} is any sequence of scalars such that y, > oo then {y, f,} 
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does not converge to 0. (Use the fact that the collection of all complex sequences 
converging to 0 has the same cardinality as [0, 1].) 
This shows that metrizability cannot be omitted in (b) of Theorem 1.28. 


. (a) Suppose F is a separating family of seminorms on a vector space X. Let 2 


be the smallest family of seminorms on X that contains Y and is closed 
under max. [This means: If p, € 2, p,¢ 2, and p = max (p,, p,), then 
p € 2.] If the construction of Theorem 1.37 is applied to A and to 2, show 
that the two resulting topologies coincide. The main difference is that 2 
leads directly to a base, rather than to a subbase. [See Remark (a) of Section 
1.38. ] 

(b) Suppose 2 is as in part (a) and A is a linear functional on X. Show that A is 
continuous if and only if there exists a p € 2 such that | Ax| < Mp(x) for all 
x € X and some constant M < oo. 


9. Suppose 
(a) X and Y are topological vector spaces, 
(b) A: X — ¥Y is linear, 
(c) N is aclosed subspace of X, 
(d) x: X + X/N is the quotient map, and 
(e) Ax = 0 for every x € N. 
Prove that there is a unique f: X/N > Y which satisfies A =f o 7, that is, 
Ax = f (x(x)) for all x € X. Prove that this fis linear and that A is continuous if 
and only if fis continuous. Also, A is open if and only if fis open. 

10. Suppose X and Y are topological vector spaces, dim Y < «0, A: X—Y is 

linear, and A(X) = Y. 
(a) Prove that A is an open mapping. 
(b) Assume, in addition, that the null space of A is closed, and prove that A is 
then continuous. 

11. If N is a subspace of a vector space X, the codimension of N in X is, by defini- 
tion, the dimension of the quotient space X/N. 

Suppose 0 < p < 1 and prove that every subspace of finite codimension is 
dense in L?. (See Section 1.47.) 

12. Suppose d,(x, y) =|x — y], d(x, y) =| (x) — O(y)|, where (x) = x/(1 + | x]). 
Prove that d, and d, are metrics on R which induce the same topology, 
although d, 1s complete and d, is not. 

13. Let C be the vector space of all complex continuous functions on [0, 1]. Define 


_ ft _1 fC) = a(x) 
ano | 1+1f@)— a1” 


Let (C, a) be C with the topology induced by this metric. Let (C, t) be the 
topological vector space defined by the seminorms 


PPJ=IfO)| Osx), 


in accordance with Theorem 1.37. 

(a) Prove that every t-bounded set in C is also o-bounded and that the identity 
map id: (C, t) > (C, a) therefore carries bounded sets into bounded sets. 

(b) Prove that id: (C, t) >(C, c) is nevertheless not continuous, although it is 
sequentially continuous (by Lebesgue’s dominated convergence theorem). 
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14. 


15. 
16. 


17. 


18. 


19. 


20. 
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Hence (C, t) is not metrizable. (See Appendix A6, or Theorem 1.32.) Show 
also directly that (C, t) has no countable local base. 
(c) Prove that every continuous linear functional on (C, t) is of the form 


io de f(x;) 
for some choice of x,,..., x, in [0, 1] and some c¢; € @. 

(d) Prove that (C, oc) contains no convex open sets other than @ and C. 

(e) Prove that id: (C, a) > (C, t) is not continuous. 

Put K = [0, 1] and define 2, as in Section 1.46. Show that the following three 

families of seminorms (where n = 0, 1, 2, ...) define the same topology on J,, if 

D = d/dx: 

(a) ||D"f ||. = sup {|D"f(x)|: —c0 <x < ow}. 


(b) Df ll, at | D'f (x) | dx. 


1/2 


(c) ||D"f ll. = 1 | D"f (x) |? ax| 


0 

Prove that the spaces C(Q) (Section 1.44) do not have the Heine-Borel property. 
Prove that the topology of CQ) does not depend on the particular choice of 
{K,}, as long as this sequence satisfies the conditions specified in Section 1.44. 
Do the same for C®(Q) (Section 1.46). 
In the setting of Section 1.46, prove that f— D*f is a continuous mapping of 
C%(Q) into C*®(Q) and also of DZ, into D,, for every multi-index a. 
Prove the proposition concerning addition in the binary system which was used 
at the end of the proof of Theorem 1.24. 
Suppose M is a dense subspace of a topological vector space X, Y is an F-space, 
and A: M — ¥ is continuous (relative to the topology that M inherits from X) 
and linear. Prove that A has a continuous linear extension A: X > Y. 

Suggestion: Let V, be balanced neighborhoods of 0 in X such that 
V.t+V,cV,_, and such that d(0, Ax) <2°-"ifxeMovV,. If x eX and x, e€ 
(x + V,) \ M, show that {Ax,} is a Cauchy sequence in Y, and define Ax to be 
its limit. Show that A is well defined, that Ax = Ax if x € M, and that A is linear 
and continuous. 
For each real number t and each integer n, define e,(t) = e'", and define 


fr=e_,+ne, (n=1, 2, 3,..,). 


Regard these functions as members of I?7(—7, 2). Let X, be the smallest closed 
subspace of I? that contains ey, e,, €,, ..., and let X, be the smallest closed 
subspace of I? that contains f,, f,, f;, .... Show that X, + X, is dense in I? but 
not closed. For instance, the vector 


is in I? but not in X, + X,.(Compare with Theorem 1.42.) 


21. 


22. 


23. 


24. 
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Let V be a neighborhood of 0 in a topological vector space X. Prove that there 
is a real continuous function f on X such that f(0) = 0 and f(x) = 1 outside V. 
(Thus X is a completely regular topological space.) Suggestion: Let V, be bal- 
anced neighborhoods of 0 such that V; + V, c V and V.,, 4+ V4, < V,. Con- 
struct fas in the proof of Theorem 1.24. Show that fis continuous at 0 and that 


| f(x) —f(Y) | Sf (x — y). 


If fis a complex function defined on the compact interval J = [0, 1] < R, define 


of) = sup {| f(x) -—f(yl: |x — yl <6,x eI, ye I}. 


If 0 < « < 1, the corresponding Lipschitz space Lip « consists of all f for which 


IF ll =| FO + sup {5° %a,(f): 6 > 0} 


is finite. Define 
lip a = {fe Lip «: lim 6" *w,(f) = 0}. 
6-0 


Prove that Lip « is a Banach space and that lip « is a closed subspace of Lip a. 
Let X be the vector space of all continuous functions on the open segment (0, 1). 
For fe X and r>0, let V(f, r) consist of all g € X such that | g(x) —ffx)| <r 
for all x € (0, 1). Let t be the topology on X that these sets V(f, r) generate. 
Show that addition is t-continuous but scalar multiplication is not. 

Show that the set W that occurs in the proof of Theorem 1.14 need not be 
convex, and that A need not be balanced unless U is convex. 


CHAPTER 


COMPLETENESS 


The validity of many important theorems of analysis depends on the com- 
pleteness of the systems with which they deal. This accounts for the inade- 
quacy of the rational number system and of the Riemann integral (to 
mention just the two best-known examples) and for the success encountered 
by their replacements, the real numbers and the Lebesgue integral. Baire’s 
theorem about complete metric spaces (often called the category theorem) is 
the basic tool in this area. In order to emphasize the role played by the 
concept of category, some theorems of this chapter (for instance, Theorems 
2.7 and 2.11) are stated in a little more generality than is usually needed. 
When this is done, simpler versions (more easily remembered but sufficient 
for most applications) are also given. 


Baire Category 


2.1 Definition Let S be a topological space. A set E < S 1s said to be 
nowhere dense if its closure E has an empty interior. The sets of the first 
category in S are those that are countable unions of nowhere dense sets. Any 
subset of S that is not of the first category is said to be of the second 
category in S. 

This terminology (due to Baire) 1s admittedly rather bland and unsug- 
gestive. Meager and nonmeager have been used instead in some texts. But 
“category arguments” are so entrenched in the mathematical literature and 
are so well known that it seems pointless to insist on a change. 

Here are some obvious properties of category that will be freely used 
in the sequel: 


42 
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(a) If A < Band Bis of the first category in S, so is A. 

(b) Any countable union of sets of the first category is of the first category. 

(c) Any closed set E < S whose interior is empty is of the first category 
in S. 

(dq) If his a homeomorphism of S onto S and if E < S, then E and h(E) 
have the same category in S. 


2.2 Baire’stheorem [If S is either 


(a) acomplete metric space, or 
(b) alocally compact Hausdorff space, 


then the intersection of every countable collection of dense open subsets of S is 
dense in S. 


This is often called the category theorem, for the following reason. 

If {E;} is a countable collection of nowhere dense subsets of S, and if 
V, is the complement of E;, then each V, is dense, and the conclusion of 
Baire’s theorem is that (| V; # @. Hence S # |) E;. 

Therefore, complete metric spaces, as well as locally compact Haus- 
dorff spaces, are of the second category in themselves. 


PROOF. Suppose V,, V,, V;,... are dense open subsets of S. Let By be 
an arbitrary nonempty open set in S. Ifn > 1 and an open B,_, # @ 
has been chosen, then (because V, is dense) there exists an open 
B, # © with 


B,, - Vi, ‘-) Biot 


In case (a), B, may be taken to be a ball of radius <1/n; in case (b) the 
choice can be made so that B, is compact. Put 


In case (a), the centers of the nested balls B, form a Cauchy sequence 
which converges to some point of K, and so K # @. In case (b), 
K # @ by compactness. Our construction shows that K c By, and 
K c V, for each n. Hence Bg intersects () V,. //// 


The Banach-Steinhaus Theorem 


2.3 Equicontinuity Suppose X and Y are topological vector spaces and 
I is a collection of linear mappings from X into Y. We say that I is 
equicontinuous if to every neighborhood W of 0 in Y there corresponds a 
neighborhood V of 0in X such that A(V) c W for allA eT. 
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If [ contains only one A, equicontinuity is, of course, the same as 
continuity (Theorem 1.17). We already saw (Theorem 1.32) that continuous 
linear mappings are bounded. Equicontinuous collections have this 
boundedness property in a uniform manner (Theorem 2.4). It is for this 
reason that the Banach-Steinhaus theorem (2.5) is often referred to as the 
uniform boundedness principle. 


2.4 Theorem Suppose X and Y are topological vector spaces, TY is an 
equicontinuous collection of linear mappings from X into Y, and E is a 
bounded subset of X. Then Y has a bounded subset F such that A(E) c F for 
every AeT. 


PROOF. Let F be the union of the sets A(E), for Ae IT. Let W be a 
neighborhood of 0 in Y. Since I is equicontinuous, there is a neigh- 
borhood V of 0 in X such that A(V) < W for all A eT. Since E is 
bounded, E c tV for all sufficiently large t. For these t, 


A(E) < A(tV) = tA(V) c tW, 
so that F < tW. Hence F is bounded. /11/ 


2.5 Theorem (Banach-Steinhaus) Suppose X and Y are topological 
vector spaces, I is a collection of continuous linear mappings from X into Y, 
and B is the set of all x € X whose orbits 


T(x) = {Ax: AeT} 


are bounded in Y. 
If B is of the second category in X, then B= X and IT is equi- 
continuous. 


PROOF. Pick balanced neighborhoods W and U of 0 in Y such that 
U+UcW. Put 
E= () A~*(0). 


Aer 


If x € B, then I(x) < nU for some n, so that x € nE. Consequently, 


00 
Bc () nE. 
n=1 
At least one nE is of the second category in X, since this is true of B. 
Since x > nx is a homeomorphism of X onto X, E is itself of the 
second category in X. But E is closed because each A is continuous. 
Therefore E has an interior point x. Then x — E contains a neighbor- 


CHAPTER 2: COMPLETENESS 45 


hood V of 0 1n_X, and 
A(V) < Ax — A(E) cU-—~UcW 


forevery AeT. 
This proves that I is equicontinuous. By Theorem 2.4, TF is uni- 
formly bounded; in particular, each I(x) is bounded in Y. Hence 


B=X. //// 


In many applications, the hypothesis that B is of the second category 
is a consequence of Baire’s theorem. For example, F-spaces are of the 
second category. This gives the following corollary of the Banach-Steinhaus 
theorem: 


2.6 Theorem If T is a collection of continuous linear mappings from an 
F-space X into a topological vector space Y, and if the sets 


T(x) = {Ax: A eT} 


are bounded in Y, for every x € X, then T is equicontinuous. 


Briefly, pointwise boundedness implies uniform boundedness 
(Theorem 2.4). 

As a special case of Theorem 2.6, let X and Y be Banach spaces, and 
suppose that 


(1) sup ||Ax|| < c for every x € X. 
AeT 


The conclusion is that there exists M < oo such that 


(2) | Ax|]| < M if |x| <landAeT. 
Hence 
(3) | Ax|] < M||x|| ifxe X andAeT. 


The following theorem establishes the continuity of limits of sequences 
of continuous linear mappings: 


2.7 Theorem Suppose X and Y are topological vector spaces, and {A,} is 
a sequence of continuous linear mappings of X into Y. 


(a) If C is the set of all x € X for which {A, x} is a Cauchy sequence in Y, 
and if C is of the second category in X, thenC = X. 


(b) If Lis the set of all x € X at which 


Ax = limA, x 


no 
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exists, if L is of the second category in X, and if Y is an F-space, then 
L = X and A: X > Y is continuous. 


PROOF. (a) Since Cauchy sequences are bounded (Section 1.29), the 
Banach-Steinhaus theorem asserts that {A,} is equicontinuous. 

One checks easily that C is a subspace of X. Hence C is dense. 
(Otherwise, C is a proper subspace of X; proper subspaces have 
empty interior; thus C would be of the first category.) 

Fix x e X; let W be a neighborhood of 0 in Y. Since {A,} is 
equicontinuous, there is a neighborhood V of 0 in X such that 
A,V)<W for n=1, 2, 3, .... Since C is dense, there exists 
x €Cno(x+4+ V). Ifn and mare so large that 


A, x — A, x’ € W, 
the identity 
(A, — A,,)x = A,(x — x’) + (A, — A,)x’ + A," — xX) 


shows that A,x —A,,xeW+W+W. Consequently, {A,x} is a 
Cauchy sequence in Y, and x € C. 

(b) The completeness of Y implies that L = C. Hence L = X, by 
(a). If V and W are as above, the inclusion A,(V) < W, valid for all n, 
implies now that A(V) < W. Thus A is continuous. //// 


The hypotheses of (b) of Theorem 2.7 can be modified in various ways. 
Here is an easily remembered version: 


2.8 Theorem If {A,} is a sequence of continuous linear mappings from an 
F-space X into a topological vector space Y, and if 


Ax = limA, x 


no 


exists for every x € X, then A is continuous. 


PROOF. Theorem 2.6 implies that {A,,} is equicontinuous. Therefore if 
W is a neighborhood of 0 in Y, we have A,(V) < W for all n and for 
some neighborhood V of 0 in X. It follows that A(V) < W; hence 

(being obviously linear) A is continuous. //// 


In the following variant of the Banach-Steinhaus theorem the cate- 
gory argument is applied to a compact set, rather than to a complete metric 
one. Convexity also enters here in an essential way (Exercise 8). 


2.9 Theorem Suppose X and Y are topological vector spaces, K is a 
compact convex set in X, I is a collection of continuous linear mappings of X 
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into Y, and the orbits 
T(x) = {Ax:AeT} 


are bounded subsets of Y, for every x € K. 
Then there is a bounded set B — Y such that A(K) < B for every A ET. 


PROOF. Let B be the union of all sets I(x), for x € K. Pick balanced 
neighborhoods W and U of 0 in Y such that U + U c W. Put 


(1) E= (\A~\(0). 


Aer 
If x e K, then I(x) c nU for some n, so that x € nE. Consequently, 
(2) K = |) (K 1 nE). 

n=1 


Since E is closed, Baire’s theorem shows that K - nE has nonempty 
interior (relative to K) for at least one n. 

We fix such an n, we fix an interior point x, of K 7 nE, we fix a 
balanced neighborhood V of 0 in X such that 


(3) K 1 (Xo + V) Cne_, 
and we fix a p > 1 such that 
(4) K <Xx_ + pV. 


Such a p exists since K is compact. 
If now x is any point of K and 


(5) z=(1—p *)xo +p “x, 
then z € K, since K is convex. Also, 
(6) Z—Xp =p (x—X)EV, 


by (4). Hence z € nE, by (3). Since A(nE) c nU for every Ae IT and 
since xX = pz — (p — 1)x,o, we have 


Ax € pnU —(p — 1)nU c pn(U + U) c pnw. 
Thus B c pnW, which proves that B is bounded. //// 


The Open Mapping Theorem 


2.10 Open mappings Suppose f maps S into T, where S and T are 
topological spaces. We say that fis open at a point p € S if f(V) contains a 
neighborhood of f(p) whenever V is a neighborhood of p. We say that f is 
open if f(U) is open in T whenever U is open in S. 
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It is clear that f is open if and only if f is open at every point of S. 
Because of the invariance of vector topologies, it follows that a linear 
mapping of one topological vector space into another is open if and only if 
it is open at the origin. 

Let us also note that a one-to-one continuous mapping f of S onto T 
is a homeomorphism precisely when fis open. 


2.11 The open mapping theorem Suppose 


(a) X is an F-space, 

(b) Y is a topological vector space, 

(c) A: X — Y is continuous and linear, and 
(d) A(X) is of the second category in Y. 


Then 


(Zi) A(X) = Y, 
(ii) A is an open mapping, and 
(iii) Y is an F-space. 
PROOF. Note that (ii) implies (i), since Y is the only open subspace of 
Y. To prove (ii), let V be a neighborhood of 0 in X. We have to show 
that A(V) contains a neighborhood of 0 in Y. 


Let d be an invariant metric on X that is compatible with the 
topology of X. Define 


(1) V, = (x: d(x, 0) < 2° "r} (n = 0, 1, 2, ...), 


where r > 0 is so small that Vj < V. We will prove that some neigh- 
borhood W of 0 in Y satisfies 


(2) W < A(V,) < ACV). 
Since V; > V, — V,, statement (b) of Theorem 1.13 implies 
(3) A(V,) > A(V,) — A(V2) > A(V2) — A(V)). 


The first part of (2) will therefore be proved if we can show that A(V3) 
has nonempty interior. But 


(4 A(X) = U KA) 


because V, is a neighborhood of 0. At least one kA(V,) is therefore of 
the second category in Y. Since y—ky is a homeomorphism of Y 
onto Y, A(V,) is of the second category in Y. Its closure therefore has 
nonempty interior. 
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To prove the second inclusion in (2), fix y, € A(V,). Assume 
n> 1 and y, has been chosen in A(V,). What was just proved for V, 
holds equally well for V,,,, so that A(V,,,) contains a neighborhood 


of 0. Hence 

(5) (Yn — AVa41)) OMY) # S. 
This says that there exists x, € V, such that 

(6) AXq € Yn — AV + 1): 


Put y,4, = y, — Ax,. Then y,4, € AV, 1) +1) and the construction pro- 
ceeds. 

Since d(x,, 0) < 2° "r, for n = 1, 2, 3, ..., the sums x, +:-- +x, 
form a Cauchy sequence which converges (by the completeness of X) 
to some x € X, with d(x, 0) < r. Hence x € V. Since 


(7) Ax, = > Ove) Sr = Js 
=1 n=1 


and since y,,4, 70 as m— o@ (by the continuity of A), we conclude 
that y, = Ax € A(V). This gives the second part of (2), and (ii) is 
proved. 

Theorem 1.41 shows that X/N is an F-space, if N is the null 
space of A. Hence (iii) will follow as soon as we exhibit an iso- 
morphism f of X/N onto Y which is also a homeomorphism. This can 
be done by defining 


(8) f(x + N) = Ax (x € X). 


It is trivial that this fis an isomorphism and that Ax = f(z(x)), where 
m is the quotient map described in Section 1.40. If V is open in Y, then 


(9) f-'(V) = n(A“*(V)) 


is open, since A is continuous and z is open. Hence f is continuous. If 
E is open in X/N, then 


(10) S(E) = An “(E)) 
is Open, since z is continuous and A is open. Consequently, f is a 
homeomorphism. //// 


2.12 Corollaries 


(a) If A is a continuous linear mapping of an F-space X onto an F-space Y, 
then A is open. 


(b) If A satisfies (a) and is one-to-one, then A~': Y > X is continuous. 
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(c) If X and Y are Banach spaces, and if A: X > Y is continuous, linear, 
one-to-one, and onto, then there exist positive real numbers a and b such 
that 


al|x|| < \|Ax|| < b]x| 


for everyxé X. 


(d) Ift, <T, are vector topologies on a vector space X and if both (X, T,) 
and (X, T,) are F-spaces, then t, =T). 


PROOF. Statement (a) follows from Theorem 2.11 and Baire’s theorem, 
since Y is now of the second category in itself. Statement (b) is an 
immediate consequence of (a), and (c) follows from (b). The two 
inequalities in (c) simply express the continuity of A~' and of A. 
Statement (d) is obtained by applying (b) to the identity mapping of 
(X, t2) onto (X, 7,). [/// 


The Closed Graph Theorem 


2.13 Graphs _ If X and Y are sets and f maps X into Y, the graph of f is 
the set of all points (x, f(x)) in the cartesian product X x Y. If X and Y are 
topological spaces, if X x Y is given the usual product topology (the small- 
est topology that contains all sets U x V with U and V open in X and Y, 
respectively), and if f: X — Y is continuous, one would expect the graph of f 
to be closed in X x Y (Proposition 2.14). For linear mappings between 
F-spaces this trivial necessary condition is also sufficient to assure conti- 
nuity. This important fact is proved in Theorem 2.15. 


2.14 Proposition If X is a topological space, Y is a Hausdorff space, and 
f: X > Y is continuous, then the graph G of f is closed. 


PROOF. Let 2 be the complement of G in X x Y; fix (Xo, yo) € 2. 
Then yo #f(Xo). Thus yo and f(x,) have disjoint neighborhoods V 
and W in Y. Since fis continuous, x, has a neighborhood U such that 
f(U) < W. The neighborhood U x V of (x9, yo) lies therefore in Q. 
This proves that Q is open. //// 


Note: One cannot omit the hypothesis that Y is a Hausdorff space. 
To see this, consider an arbitrary topological space X, and let f: X — X be 
the identity. Its graph is the diagonal 


D = {(x,x):xeX}oXx x X. 


The statement “D is closed in X x X” is just a rewording of the Hausdorff 
separation axiom. 
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2.15 The closed graph theorem Suppose 


(a) X and Y are F-spaces, 
(b) A: X > Y is linear, 
(c) G= {(x, Ax): x € X} is closed in X x Y. 


Then A is continuous. 


PROOF. X x Y is a vector space if addition and scalar multiplication 
are defined componentwise: 


o(X1, ¥1) + B(X2, V2) = (ax, + Bxz, ay, + By2). 


There are complete invariant metrics dy and dy on X and Y, respec- 
tively, which induce their topologies. If 


A((X1, V1), (X25 Y2)) = Ax(X1, X2) + dy(V1, Yo), 


then d is an invariant metric on X x Y which is compatible with its 

product topology and which makes X x Y into an F-space. (The easy 

but tedious verifications that are needed here are left as an exercise.) 
Since A is linear, G is a subspace of X x Y. Closed subsets of 

complete metric spaces are complete. Therefore G is an F-space. 
Define z,: G—> X andz,: X x YY by 


™1(x, Ax) = x, ™>(x, y) = y. 


Now 7, 1s a continuous linear one-to-one mapping of the F-space G 

onto the F-space X. It follows from the open mapping theorem that 
m,':X -G 

is continuous. But A = 2, °72;,' and z, is continuous. Hence A is 

continuous. HTT 


Remark. The crucial hypothesis (c), that G is closed, is often verified 
in applications by showing that A satisfies property (c’) below: 


(c') If {x,} is a sequence in X such that the limits 


x = limx, and y= limAx, 


exist, then y = Ax. 
Let us prove that (c’) implies (c). Pick a limit point (x, y) of G. 
Since X x Y is metrizable, 


(x, y) = lim (x,, Ax,) 


noo 
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for some sequence {x,}. It follows from the definition of the product 
topology that x, x and Ax,— y. Hence y = Ax, by (c’), and so 
(x, y) € G, and G is closed. 

It is just as easy to prove that (c) implies (c’). 


Bilinear Mappings 


2.16 Definitions Suppose X, Y, Z are vector spaces and B maps 
X x Y into Z. Associate to each x € X and to each y € Y the mappings 


B,: Y>Z and B:X>Z 
by defining 
B,(y) = B(x, y) = B(x). 


B is said to be bilinear if every B, and every B” are linear. 

If X, Y, Z are topological vector spaces and if every B, and every B?” is 
continuous, then B is said to be separately continuous. If B is continuous 
(relative to the product topology of X x Y) then B is obviously separately 
continuous. In certain situations, the converse can be proved with the aid of 
the Banach-Steinhaus theorem. 


2.17 Theorem Suppose B: X x Y —Z is bilinear and separately contin- 
uous, X is an F-space, and Y and Z are topological vector spaces. Then 


(1) B(x, Yn) aa B(xo, Yo) in Z 


whenever X, > Xq in X and y, > yo in Y. If Y is metrizable, it follows that B 
is continuous. 


PROOF. Let U and W be neighborhoods of 0 in Z such that 
U+Uc W. Define 


b,(x) = B(x, y,) (X€X7 = 1,2, 3,, s<2). 
Since B is continuous as a function of y, 


lim b,(x) = B(x, Yo) (x € X). 
Thus {b,(x)} is a bounded subset of Z, for each x € X. Since each J, is 
a continuous linear mapping of the F-space X, the Banach-Steinhaus 
theorem 2.6 implies that {b,} is equicontinuous. Hence there is a 
neighborhood V of 0 in X such that 


b(V)cU  (n=1,2,3,...). 
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Note that 
BXy Yn) = B(Xo, Yo) = bX - Xo) + BiXo, Yn — Yo). 


If n is sufficiently large, then (i) x, € x) + V, so that b,(x, — x9) € U, 
and (ii) B(xo, ¥, — Yo) € U, since B is continuous in y and B(x, 0) = 
0. Hence 

B(x, ; Yn) = B(Xo, Yo) eU+UcW 


for all large n. This gives (1). 
If Y is metrizable, so is X x Y, and the continuity of B then 
follows from (1). (See Appendix A6.) Ei} 


Exercises 


1. If X is an infinite-dimensional topological vector space which is the union of 
countably many finite-dimensional subspaces, prove that X is of the first cate- 
gory in itself. Prove that therefore no infinite-dimensional F-space has a count- 
able Hamel basis. 

(A set B is a Hamel basis for a vector space X if B is a maximal linearly 
independent subset of X. Alternatively, B is a Hamel basis if every x € X has a 
unique representation as a finite linear combination of elements of £.) 

2. Sets of first and second category are “small” and “large” in a topological sense. 
These notions are different when “small” and “large” are understood in the 
sense of measure, even when the measure is intimately related to the topology. 
To see this, construct a subset of the unit interval which is of the first category 
but whose Lebesgue measure is 1. 

3. Put K =[-—1, 1]; define D, as in Section 1.46 (with R in place of R"). Suppose 
{ f,} is a sequence of Lebesgue integrable functions such that 


Ad = lim | | f(t\o(t) dt 


no 


exists for every ¢ € D,. Show that A is a continuous linear functional on Dx. 
Show that there is a positive integer p and a number M < o such that 


1 
| SAlt)P(t) dt} < M||D?||,, 
-1 
for all n. For example, if f(t) = n°t on [—1/n, 1/n] and 0 elsewhere, show that 
this can be done with p = 1. Construct an example where it can be done with 
p = 2 but not with p = 1. 
4. Let L’ and LV’ be the usual Lebesgue spaces on the unit interval. Prove that L’ is 
of the first category in L’, in three ways: 
(a) Show that {f: { | f |? <n} is closed in L’ but has empty interior. 
(b) Put g, =n on [0, n~ *], and show that 


| J0,-+0 


for every fe L? but not for every fe L’. 
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(c) Note that the inclusion map of LP’ into L’ is continuous but not onto. 
Do the same for I? and [7 if p < q. 


. Prove results analogous to those of Exercise 4 for the spaces 7”, where 7? is the 


Banach space of all complex functions x on {0, 1, 2,...} whose norm 


oo 1/p 
x1, = » x 


is finite. 


. Define the Fourier coefficients f(n) of a function f € 2(T) (T is the unit circle) by 


‘ |e ea ; 
fy =~ { fee" d0 


for all n € Z (the integers). Put 
A, f= ¥ fl. 
k=-n 


Prove that {fe L?(T): lim,.,, A, f exists} is a dense subspace of 1?(T) of the 
first category. 


. Let C(T) be the set of all continuous complex functions on the unit circle T. 


Suppose {y,} (n € Z) is a complex sequence that associates to each fe C(T) a 
function A f € C(T) whose Fourier coefficients are 


(Af)*(n) =>, f(r) (ne Z). 


(The notation is as in Exercise 6.) Prove that {y,} has this multiplier property if 
and only if there is a complex Borel measure yp on T such that 


Yn = | e™ du(0) (ne Z). 


Suggestion: With the supremum norm, C(T) is a Banach space. Apply the 
closed graph theorem. Then consider the functional 


F>(APKD = Ln f(n) 


and apply the Riesz representation theorem ([23], Th. 6.19). (The above series 
may not converge; use it only for trigonometric polynomials.) 


. Define functionals A,, on 7? (see Exercise 5) by 


A,,x = 3 n*x(n) (m=1, 2, 3,..,). 


Define x, € 7” by x,(n) = 1/n, x,(i) = 0 if i #n. Let K < @? consist of 0, x1, x, 
X3,.... Prove that K is compact. Compute A,, x,,. Show that {A,, x} is bounded 
for each x € K but {A,,x,,} is not. Convexity can therefore not be omitted from 
the hypotheses of Theorem 2.9. 


10. 


11. 


12. 


13. 


14. 


15. 


16. 
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Choose c, > 0 so that }\ c, = 1, }) nc, = 0. Take x = }\ c, x,. Show that 
x lies in the closed convex hull of K (by definition, this is the closure of the 
convex hull) and that {A,,x} is not bounded. 

Show that the convex hull of K is not closed. 


. Suppose X, Y, Z are Banach spaces and 


B:X x YZ 
is bilinear and continuous. Prove that there exists M < oo such that 
|| B(x, y)] < M||x|}llyll (xe X,ye Y). 


Is completeness needed here? 

Prove that a bilinear mapping is continuous if it is continuous at the origin 
(0, 0). 

Define B(x,, x2; y) = (x,y, X2 y). Show that B is a bilinear continuous mapping 
of R? x R onto R? which is not open at (1, 1; 0). Find all points where this B is 
open. 

Let X be the normed space of all real polynomials in one variable, with 


fl -| | f(t)| dt. 


Put BCS, 9) =f > f(t)g(t) dt, and show that B is a bilinear functional on X x X 
which is separately continuous but is not continuous. 

Suppose X is a topological vector space which is of the second category in itself. 
Let K be a closed, convex, absorbing subset of X. Prove that K contains a 
neighborhood of 0. 

Suggestion: Show first that H = K 7 (—K) is absorbing. By a category argu- 
ment, H has interior. Then use 


2H=HA+H=H — di. 


Show that the result is false without convexity of K, even if X = R*. Show that 

the result is false if X is L? topologized by the L'-norm (as in Exercise 4). 

(a) Suppose X and Y are topological vector spaces, {A,} is an equicontinuous 
sequence of linear mappings of X into Y, and C is the set of all x at which 
{A,(x)} is a Cauchy sequence in Y. Prove that C is a closed subspace of X. 

(b) Assume, in addition to the hypotheses of (a), that Y is an F-space and that 
{A,(x)} converges in some dense subset of X. Prove that then 

A(x) = lim A,(x) 
exists for every x € X and that A is continuous. 

Suppose X is an F-space and Y is a subspace of X whose complement is of the 

first category. Prove that Y = X. Hint: Y must intersect x + Y for every x € X. 

Suppose that X and K are metric spaces, that K is compact, and that the graph 

of f: X > K is a closed subset of X x K. Prove that fis continuous. (This is an 

analogue of Theorem 2.15 but is much easier.) Show that compactness of K 

cannot be omitted from the hypotheses, even when X is compact. 


CHAPTER 
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CONVEXITY 


This chapter deals primarily (though not exclusively) with the most impor- 
tant class of topological vector spaces, namely, the locally convex ones. The 
highlights, from the theoretical as well as the applied standpoints, are 
(a) the Hahn-Banach theorems (assuring a supply of continuous linear func- 
tionals that is adequate for a highly developed duality theory), (b) the 
Banach-Alaoglu compactness theorem in dual spaces, and (c) the Krein- 
Milman theorem about extreme points. Applications to various problems in 
analysis are postponed to Chapter 5. 


The Hahn-Banach Theorems 


The plural is used here because the term “Hahn-Banach theorem” 1s cus- 
tomarily applied to several closely related results. Among these are the 
dominated extension theorems 3.2 and 3.3 (in which no topology is involved), 
the separation theorem 3.4, and the continuous extension theorem 3.6. 
Another separation theorem (which implies 3.4) is stated as Exercise 3. 


3.1 Definitions The dual space of a topological vector space X is the 
vector space X* whose elements are the continuous linear functionals on X. 
Note that addition and scalar multiplication are defined in X* by 


(A, + A,)x = A,x + Ax, (aA)x =a- Ax. 


It is clear that these operations do indeed make X* into a vector space. 
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It will be necessary to use the obvious fact that every complex vector 
space is also a real vector space, and it will be convenient to use the follow- 
ing (temporary) terminology: An additive functional A on a complex vector 
space X is called real-linear (complex-linear) if A(ax) = aAx for every x € X 
and for every real (complex) scalar «. Our standing rule that any statement 
about vector spaces in which no scalar field is mentioned applies to both 
cases 1s unaffected by this temporary terminology and is still in force. 

If u is the real part of a complex-linear functional f on X, then u is 
real-linear and 


(1) F(x) = u(x) — infix) = (x € X) 


because z = Re z — i Re (iz) for every z € ¢. 

Conversely, if u: X — R is real-linear on a complex vector space X 
and if f is defined by (1), a straightforward computation shows that f is 
complex-linear. 

Suppose now that X is a complex topological vector space. The above 
facts imply that a complex-linear functional on X is in X* if and only if its 
real part is continuous, and that every continuous real-linear u: X — R is 
the real part of a unique f € X*. 


3.2 Theorem Suppose 


(a) M isasubspace of areal vector space X, 
(b) p: X >R satisfies 


P(x + y) < P(x) + Ply) = and — pf tx) = p(x) 
ifxe X,ye X,t>0), 
(c) ff: M- Ris linear and f(x) < p(x) on M. 


Then there exists a linear A: X —> R such that 
Ax = f(x) (x € M) 
and 


— p(—x) < Ax < p(x) (x € X). 


PROOF. If M 4 X, choose x, € X, x, ¢ M, and define 
M,=({x+tx,;:xeM,te R}. 
It is clear that M, is a vector space. Since 


F(x) + f(y) =f(% + y) S P(x + ¥) S P(x — x1) + P(X + Y), 
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we have 
(1) F(x) — PX — x1) S Py +xi)-f(y)  — (% ye M). 
Let « be the least upper bound of the left side of (1), as x ranges over 
M. Then 
(2) f(x) -aS p(x—%x,) (xeM) 
and 
(3) f(y)+a<sply+x) (ye M). 
Define f, on M, by 
(4) F(x + tx,) =f (x) + ta (x e M, te R). 


Then f, =f on M, and f, is linear on M,. 

Take t > 0, replace x by t~!x in (2), replace y by t~ 'y in (3), and 
multiply the resulting inequalities by t. In combination with (4), this 
proves that f, < pon M,. 

The second part of the proof can be done by whatever one’s 
favorite method of transfinite induction is; one can use well-ordering, 
or Zorn’s lemma, or Hausdorff’s maximality theorem. 

Let F be the collection of all ordered pairs (M’, f’), where M’ is 
a subspace of X that contains M and f” is a linear functional on M’ 
that extends f and satisfies f’ < p on M’. Partially order F by declar- 
ing (M’, f’) < (M", f”) to mean that M’ c M” and f” = f’ on M’. By 
Hausdorff’s maximality theorem there exists a maximal totally 
ordered subcollection Q of 7. 

Let ® be the collection of all M’ such that (M’, f’) e Q. Then ® is 
totally ordered by set inclusion, and the union M of all members of ® 
is therefore a subspace of X. If x e M then x e M’ for some M’ e€ O; 
define Ax = f’(x), where f’ is the function which occurs in the pair 
(M',f) €Q. 

It is now easy to check that A is well defined on M, that A is 
linear, and that A < p. If M were a proper subspace of X, the first 
part of the proof would give a further extension of A, and this would 
contradict the maximality of Q. Thus M = X. 

Finally, the inequality A < p implies that 


—P(—x) < —A(—x) = Ax 
for all x ¢ X. This completes the proof. Vai 


3.3. Theorem Suppose M is a subspace of a vector space X, p is a semi- 
norm on X, and f is a linear functional on M such that 


If)|< px) (eM). 
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Then f extends to a linear functional A on X that satisfies 


| Ax | < p(x) (x € X). 


PROOF. If the scalar field is R, this is contained in Theorem 3.2, since p 
now Satisfies p(—x) = p(x). 

Assume that the scalar field is ¢. Put u = Re f’ By Theorem 3.2 
there is a real-linear U on X such that U =u on M and U < pon X. 
Let A be the complex-linear functional on X whose real part is U. The 
discussion in Section 3.1 implies that A = fon M. 

Finally, to every x € X corresponds an « € ¢, |a«| = 1, such that 
aAx = |Ax]|. Hence 


| Ax | = A(ax) = U(ax) < p(ax) = p(x). /[//1 
Corollary. If X is a normed space and x, € X, there exists A € X* such 
that 
AXo = ||Xol and |Ax|< |x} forall x e X. 
PROOF. If x9 = 0, take A=0. If x) #0, apply Theorem 3.3, with 


p(x) = ||x||, M the one-dimensional space generated by xX,, and 
f (4X9) = a||xo|| on M. [ff 


3.4 Theorem Suppose A and B are disjoint, nonempty, convex sets in a 
topological vector space X. 
(a) If A is open there exist A € X* and y € R such that 

Re Ax <y < Re Ay 


for every x € A and for every y € B. 


(b) If A is compact, B is closed, and X is locally convex, then there exist 
Ae X*,y, € R, y2 € R, such that 


Re Ax < y; < y2 < Re Ay 
for every x € A and for every y € B. 


Note that this is stated without specifying the scalar field; if it is R, 
then Re A = A, of course. 


PROOF. It is enough to prove this for real scalars. For if the scalar field 
is ¢ and the real case has been proved, then there is a continuous 
real-linear A, on X that gives the required separation; if A is the 
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unique complex-linear functional on X whose real part is A,, then 
A € X*. (See Section 3.1.) Assume real scalars. 

(a) Fix Ap € A, bp € B. Put Xp = bp — ag; put C= A-— B+ Xp. 
Then C is a convex neighborhood of 0 in X. Let p be the Minkowski 
functional of C. By Theorem 1.35, p satisfies hypothesis (b) of 
Theorem 3.2. Since A (1 B= @, Xo ¢ C, and so p(xo) > 1. 

Define f(tx,) = t on the subspace M of X generated by xp. If 
t > 0 then 


f(tXo) = t < tp(Xo) = P(tXo); 


if t < O then f(tx9) < 0 < p(tx,). Thus f < p on M. By Theorem 3.2, f 
extends to a linear functional A on X that also satisfies A < p. In 
particular, A < 1 on C, hence A > —1 on —C, so that |A| < 1 on the 
neighborhood C m (—C) of 0. By Theorem 1.18, A € X*. 

If now a € A and b ec B, we have 


Aa — Ab+1=A(a—b+ x9) < p(a— b+ X9) < 1 


since Axy = 1,a— b+ xX, € C, and C is open. Thus Aa < Ab. 

It follows that A(A) and A(B) are disjoint convex subsets of R, 
with A(A) to the left of A(B). Also, A(A) is an open set since A is open 
and since every nonconstant linear functional on X is an open 
mapping. Let y be the right end point of A(A) to get the conclusion of 
part (a). 

(b) By Theorem 1.10 there is a convex neighborhood V of 0 in 
X such that (4 + V) a B= @©. Part (a), with A + V in place of A, 
shows that there exists A € X* such that A(A + V) and A(B) are dis- 
joint convex subsets of R, with A(A + V) open and to the left of A(B). 
Since A(A) is a compact subset of A(A + V), we obtain the conclusion 


of (b). //// 


Corollary. If X is a locally convex space then X* separates points on 
Xx. 


PROOF. If x, € X, x, € X, and x, # x,, apply (b) of Theorem 3.4 with 
A= hs ey). //1/ 


3.5 Theorem Suppose M is a subspace of a locally convex space X, and 
Xo € X. If Xo is not in the closure of M, then there exists A € X* such that 
AXo = 1 but Ax = 0 for every x € M. 


PROOF. By (b) of Theorem 3.4, with A = {x,} and B = M, there exists 
A € X* such that Ax, and A(M) are disjoint. Thus A(M) is a proper 
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subspace of the scalar field. This forces A(M) = {0} and Ax, # 0. The 
desired functional is obtained by dividing A by Axo. Ji) 


Remark. This theorem is the basis of a standard method of treating 
certain approximation problems: In order to prove that an x, € X lies 
in the closure of some subspace M of X it suffices (if X is locally 
convex) to show that Ax, = 0 for every continuous linear functional 
A on X that vanishes on M. 


3.6 Theorem If fis a continuous linear functional on a subspace M of a 
locally convex space X, then there exists A € X* such that A = fon M. 


Remark. For normed spaces this is an immediate corollary of 
Theorem 3.3. The general case could also be obtained from 3.3, by 
relating the continuity of linear functionals to seminorms (see Exercise 
8, Chapter 1). The proof given below shows that Theorem 3.6 depends 
only on the separation property of Theorem 3.5. 


PROOF. Assume, without loss of generality, that fis not identically 0 
on M. Put 


My = {xe M: f(x) = 0} 


and pick x, € M such that f(x.) = 1. Since fis continuous, x, is not in 
the M-closure of My, and since M inherits its topology from X, it 
follows that x, is not in the X-closure of My. 

Theorem 3.5 therefore assures the existence of a A € X* such 
that Ax)» = land A =0OonM,. 

If x e M, then x — f(x)xy € Mo, since f(x.) = 1. Hence 


Ax — f(x) = Ax —f(x)AxX9 = A(x — f(x)Xo9) = 0. 
Thus A = fon M. [// 


We conclude this discussion with another useful corollary of the 
separation theorem. 


3.7 Theorem Suppose B is a convex, balanced, closed set in a locally 
convex space X, X,» € X, but x9 ¢ B. Then there exists A € X* such that 
|Ax| < 1 for all x € B, but Ax, > 1. 


PROOF. Since B is closed and convex, we can apply (b) of Theorem 3.4, 
with A = {x}, to obtain A, € X* such that A,x9 = re” lies outside 
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the closure K of A,(B). Since B is balanced, so is K. Hence there 
exists s, O<s<r, so that |z|<s for all ze K. The functional 
A =s ‘'e~A, has the desired properties. //// 


Weak Topologies 


3.8 Topological preliminaries The purpose of this section is to explain 
and illustrate some of the phenomena that occur when a set is topologized 
in several ways. 

Let t, and t, be two topologies on a set X, and assume 7, < T,; that 
is, every T,-open set is also t,-open. Then we say that t, is weaker than t,, 
or that t, is stronger than t,. [Note that (in accordance with the meaning of 
the inclusion symbol c) the terms “weaker” and “stronger” do not 
exclude equality.] In this situation, the identity mapping on X is continuous 
from (X, t,) to (X, t,) and is an open mapping from (X, T,) to (X, T,). 

As a first illustration, let us prove that the topology of a compact 
Hausdorff space has a certain rigidity, in the sense that it cannot be 
weakened without losing the Hausdorff separation axiom and cannot be 
strengthened without losing compactness: 


(a) Ift, < Tt, are topologies on a set X, if t, is a Hausdorff topology, and if 
T, is compact, then t, = T2. 


To see this, let F < X be 1,-closed. Since X is t,-compact, so is F. 
Since T, € Tz, it follows that F is t,-compact. (Every t,-open cover of F is 
also a t,-open cover.) Since t, is a Hausdorff topology, it follows that F is 
t,-closed. 

As another illustration, consider the quotient topology ty of X/N, as 
defined in Section 1.40, and the quotient map 2: X > X/N. By its very 
definition, ty 1s the strongest topology on X/N that makes z continuous, 
and it is the weakest one that makes x an open mapping. Explicitly, if t’ 
and t” are topologies on X/N, and if z is continuous relative to t’ and open 
relative to t”, then tT Cty ct”. 

Suppose next that X is a set and F is a nonempty family of mappings 
f: X + Y,, where each Y, is a topological space. (In many important cases, 
Y, is the same for all fe F#.) Let t be the collection of all unions of finite 
intersections of sets f~*(V), with fe F and V open in Y,. Then t is a 
topology on X, and it is in fact the weakest topology on X that makes every 
fe¢¥F continuous: If t’ is any other topology with that property, then 
tov’. This t is called the weak topology on X induced by ¥, or, more 
succinctly, the ¥-topology of X. 

The best-known example of this situation is undoubtedly the usual 
way in which one topologizes the cartesian product X of a collection of 
topological spaces X,. If z,(x) denotes the ath coordinate of a point x € X, 
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then z, maps X onto X,, and the product topology t of X is, by definition, 
its {7,}-topology, the weakest one that makes every z, continuous. Assume 
now that every X, 1s a compact Hausdorff space. Then t is a compact topol- 
ogy on X (by Tychonoff’s theorem), and proposition (a) implies that t 
cannot be strengthened without spoiling Tychonoff’s theorem. 

In the last sentence a special case of the following proposition was 
tacitly used: 


(b) If F isa family of mappings f: X — Y,, where X is a set and each Y, is 
a Hausdorff space, and if F¥ separates points on X, then the F¥-topology 
of X is a Hausdorff topology. 


For if p#q are points of X, then f(p) # f(q) for some fe ¥; the 
points f(p) and f(q) have disjoint neighborhoods in Y, whose inverse images 
under f are open (by definition) and disjoint. 

Here is an application of these ideas to a metrization theorem. 


(c) If X is a compact topological space and if some sequence { f,} of contin- 
uous real-valued functions separates points on X, then X is metrizable. 


Let t be the given topology of X. Suppose, without loss of generality, 
that | f,| <1 for all n, and let t, be the topology induced on X by the 
metric 


lp, 4) = y 2-*| £9) f(a) |. 


This is indeed a metric, since {f,} separates points. Since each f, is 
t-continuous and the series converges uniformly on X x X, d is a 
t-continuous function on X x X. The balls 


Bp) = {q € X: d(p, q) <r} 


are therefore t-open. Thus t, < t. Since t, is induced by a metric, ty 18 a 
Hausdorff topology, and now (a) implies that t = Ty. 

The following lemma has applications in the study of vector topol- 
ogies. In fact, the case n= 1 was needed (and proved) at the end of 
Theorem 3.6. 


3.9 Lemma Suppose A,,..., A, and A are linear functionals on a vector 
space X. Let 


Neier Aggy SS A,X 0}. 


The following three properties are then equivalent : 


(a) There are scalars «,,..., a, such that 


A=a,A,+-°°:+4,A,. 
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(b) There exists y < 00 such that 


(c) 


| Ax|<y max |A;x| (x € X). 


l<i<n 


Ax = 0 for everyx EN. 


PROOF. It is clear that (a) implies (b) and that (b) implies (c). Assume (c) 
holds. Let ® be the scalar field. Define 1: X — ®" by 


mx) = (A,x, ..., A,X). 


If x(x) = n(x’), then (c) implies Ax = Ax’. Hence f(x(x)) = Ax defines a 
linear functional f on 2(X). Extend f to a linear functional F on ®". 
This means that there exist a, € ® such that 


F(u,,..., U,) = @%,U, +°°° + 4,U,- 
Thus 
Ax = F(n(x)) = F(A,x, ..., A,X) = ¥ a A;x, 


i=1 


which is (a). //// 


3.10 Theorem Suppose X is a vector space and X' is a separating vector 
space of linear functionals on X. Then the X'-topology t' makes X into a 
locally convex space whose dual space is X’. 


The assumptions on X’ are, more explicitly, that X’ is closed under 


addition and scalar multiplication and that Ax, # Ax, for some A e€ X’ 
whenever x, and x, are distinct points of X. 


PROOF. Since R and ¢ are Hausdorff spaces, (b) of Section 3.8 shows 
that t’ is a Hausdorff topology. The linearity of the members of X’ 


shows that 1’ is translation-invariant. If A,,..., A, € X’, ifr; > 0, and 
if 
(1) V ={x:|A;x|<r; for 1<i<n}, 


then V is convex, balanced, and V e 1’. In fact, the collection of all V 
of the form (1) is a local base for t’. Thus 1’ is a locally convex topol- 
ogy on X. 

If (1) holds, then $V + 4V =V. This proves that addition is 
continuous. Suppose x € X and « is a scalar. Then x € sV for some 
s>0.If|B—a|<rand y—xerV then 


By — ax =(B — a)y + aly — x) 
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lies in V, provided that r is so small that 
rst+ryt+|al[r<l. 


Hence scalar multiplication is continuous. 

We have now proved that 1’ is a locally convex vector topology. 
Every Ae X’ is t’-continuous. Conversely, suppose A is a t’-con- 
tinuous linear functional on X. Then | Ax| < 1 for all x in some set V 
of the form (1). Condition (b) of Lemma 3.9 therefore holds; hence so 
does (a): A=) a; A;. Since A; € X’ and X’ is a vector space, A € X’. 
This completes the proof. EH 


Note: The first part of this proof could have been based on Theorem 
1.37 and the separating family of seminorms p,(A € X’) given by p,(x) = 
|Ax|. 


3.11. The weak topology of a topological vector space Suppose X is 
a topological vector space (with topology t) whose dual X* separates points 
on X. (We know that this happens in every locally convex X. It also 
happens in some others; see Exercise 5.) The X*-topology of X is called the 
weak topology of X. 

We shall let X, denote X topologized by this weak topology t,,. 
Theorem 3.10 implies that X,, is a locally convex space whose dual is 
also X*. 

Since every A € X* is t-continuous and since t,, is the weakest topol- 
ogy on X with that property, we have t, ct. In this context, the given 
topology t will often be called the original topology of X. 

Self-explanatory expressions such as original neighborhood, weak 
neighborhood, original closure, weak closure, originally bounded, weakly 
bounded, etc., will be used to make it clear with respect to which topology 
these terms are to be understood.’ 

For instance, let {x,} be a sequence in X. To say that x, > 0 originally 
means that every original neighborhood of 0 contains all x, with sufficiently 
large n. To say that x, ~ 0 weakly means that every weak neighborhood of 
0 contains all x, with sufficiently large n. Since every weak neighborhood of 


‘When X is a Fréchet space (hence, in particular, when X is a Banach space) the original 
topology of X is usually called its strong topology. In that context, the terms “strong” and 
“strongly” will be used in place of “original” and “originally.” For locally convex spaces in 
general, the term “strong topology” has been given a specific technical meaning. See [15], pp. 
256-258; also [14], p. 169. It seems therefore advisable to use “original” in the present general 
discussion. 
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0 contains a neighborhood of the form 
(1) V ={x:|A;x| <r; for 1<i<n}, 


where A; € X* and 7; > 0, it is easy to see that x, +0 weakly if and only if 
Ax, 2 0 for every A € X*. 

Hence every originally convergent sequence converges weakly. (The 
converse is usually false; see Exercises 5 and 6.) 

Similarly, a set E < X is weakly bounded (that is, E is a bounded 
subset of X,,) if and only if every V as in (1) contains tE for some 
t = t(V) > 0. This happens if and only if there corresponds to each A € X* 
a number (A) < oo such that | Ax| < y(A) for every x € E. In other words, 
a set EX is weakly bounded if and only if every A € X* is a bounded 
function on E. 

Let V again be as in (1), and put 


N= {xe Ayx a SA, x = 0). 


Since x >(A,x, ..., A,x) maps X into ¢ with null space N, we see 
that dim X <n+dim N. Since N c V, this leads to the following conclu- 
sion. 


If X is infinite-dimensional then every weak neighborhood of 0 contains 
an infinite-dimensional subspace; hence X ,, is not locally bounded. 


This implies in many cases that the weak topology is strictly weaker 
than the original one. Of course, the two may coincide: Theorem 3.10 
implies that (X,,), = X,,. 

We now come to a more interesting result. 


3.12 Theorem Suppose E is a convex subset of a locally convex space X. 
Then the weak closure E.,, of E is equal to its original closure E. 


PROOF. E,, is weakly closed, hence originally closed, so that Ec E,. 
To obtain the opposite inclusion, choose x9 € X, Xo ¢ E. Part (b) of 
the separation theorem 3.4 shows that there exist Ace X* and ye R 
such that, for every x € E, 


Re Ax, < yp < Re Ax. 
The set {x: Re Ax < y} is therefore a weak neighborhood of x, that 
does not intersect E. Thus x, is not in E,.This provesE, cE. — //// 


Corollaries. For convex subsets of a locally convex space, 


(a) originally closed equals weakly closed, and 
(b) originally dense equals weakly dense. 


CHAPTER 3: CONVEXITY 67 


The proofs are obvious. Here is another noteworthy consequence of 
Theorem 3.12. 


3.13 Theorem Suppose X is a metrizable locally convex space. If {x,} is 
a sequence in X that converges weakly to some x € X, then there is a 
sequence {y;\ in X such that 


(a) each y; is a convex combination of finitely many x,,, and 
(b) y;—x originally. 


Conclusion (a) says, more explicitly, that there exist numbers «,, > 0, 
such that 


>» in = 1, Vee > in Xas 
n=1 


and, for each i, only finitely many a,, are +0. 


PROOF. Let H be the convex hull of the set of all x,; let K be the weak 
closure of H. Then x € K. By Theorem 3.12, x is also in the original 
closure of H. Since the original topology of X is assumed to be 
metrizable, it follows that there is a sequence {y,} in H that converges 
originally to x. /// 


To get a feeling for what is involved here, consider the following 
example. 

Let K be a compact Hausdorff space (the unit interval on the real line 
is a sufficiently interesting one), and assume that f and f, (n = 1, 2, 3, ...) are 
continuous complex functions on K such that f(x) > f(x) for every x € K, 
as n> oo, and such that | f,(x)| < 1 for all n and all x e K. Theorem 3.13 
asserts that there are convex combinations of the f, that converge uniformly 
to f. 

To see this, let C(K) be the Banach space of all complex continuous 
functions on K, normed by the supremum. Then strong convergence is the 
same as uniform convergence on K. If yu is any complex Borel measure 
on K, Lebesgue’s dominated convergence theorem implies that | f, du— 
{ fdu. Hence f,>f weakly, by the Riesz representation theorem which 
identifies the dual of C(K) with the space of all regular complex Borel mea- 
sures on K. Now Theorem 3.13 can be applied. 

After this short detour we now return to our main line of develop- 
ment. 


3.14 The weak*-topology of a dual space Let X again be a topologi- 
cal vector space whose dual is X*. For the definitions that follow, it is 
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irrelevant whether X* separates points on X or not. The important obser- 
vation to make is that every x € X induces a linear functional f, on X*, 
defined by 


fA = Ax, 


and that { f,,: x € X} separates points on X*. 

The linearity of each f, is obvious; if f, A =f, A’ for all x e X, then 
Ax = A’x for all x, and so A = A’ by the very definition of what it means 
for two functions to be equal. 

We are now in the situation described by Theorem 3.10, with X* in 
place of X and with X in place of X’. 

The X-topology of X* is called the weak*-topology of X* 
(pronunciation: weak star topology). 

Theorem 3.10 implies that this is a locally convex vector topology on 
X* and that every linear functional on X* that is weak*-continuous has the 
form A — Ax for some x € X. 

The weak*-topologies have a very important compactness property to 
which we now turn our attention. Various pathological features of the 
weak- and weak*-topologies are described in Exercises 9 and 10. 


Compact Convex Sets 


3.15 The Banach-Alaoglu theorem If V is a neighborhood of 0 in a 
topological vector space X and if 


K = {Ae X*:|Ax|<1 forevery xeV} 


then K is weak*-compact. 


Note: K is sometimes called the polar of V. It is clear that K is 
convex and balanced, because this is true of the unit disc in ¢ (and of the 
interval [—1, 1] in R). There is some redundancy in the definition of K, 
since every linear functional on X that is bounded on V is continuous, 
hence is in X*. 


PROOF. Since neighborhoods of 0 are absorbing, there corresponds to 
each x € X a number »(x) < oo such that x € »(x)V. Hence 


(1) | Ax] < y(x) (x e X, Ae K). 


Let D,, be the set of all scalars « such that | «| < y(x). Let t be the 
product topology on P, the cartesian product of all D,, one for each 
x € X. Since each D,. is compact, so is P, by Tychonoff’s theorem. The 
elements of P are the functions fon X (linear or not) that satisfy 


(2) I F(x) <x) (x € X). 
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Thus K c X* «- P. It follows that K inherits two topologies: 
one from X* (its weak*-topology, to which the conclusion of the 
theorem refers) and the other, t, from P. We will see that 


(a) these two topologies coincide on K, and 
(b) K isaclosed subset of P. 


Since P is compact, (b) implies that K is t-compact, and then (a) 
implies that K is weak*-compact. 
Fix some Ay € K. Choose x; € X, for 1 <i<n; choose 6 > 0. 


Put 
(3) W, = {Ae X*: |Ax;-—Apx;|<6 for 1<i<n} 
and 
(4) W,={feP:|f(x)—Aox;|<6 for 1<i<n}. 


Let n, x;, and 6 range over all admissible values. The resulting sets W, 
then form a local base for the weak*-topology of X* at A, and the 
sets W, form a local base for the product topology t of P at Ay. Since 
K cP oo X*, we have 


W,iAK=Wioa K. 


This proves (a). 

Next, suppose fy is in the t-closure of K. Choose x € X, ye X, 
scalars a and Bf, and ¢ > 0. The set of all fe P such that | f—fp | <é 
at x, at y, and at ax + By is a t-neighborhood of f,. Therefore K 
contains such an f. Since this fis linear, we have 


folax + By) — afo(x) — Bfoly) 
= (fo —f ax + By) + af — foXx) + BS — foXy), 
so that 


| folax + By) — afo(x) — Bfoly)| < (+ lal + 1B [e. 


Since € was arbitrary, we see that fp is linear. Finally, if x e V and 
€ > 0, the same argument shows that there is an fe K such that 
| f(x) — fo(x)| < e. Since | f(x)| < 1, by the definition of K, it follows 
that | fo(x)| < 1. We conclude that fo e K. This proves (b) and hence 
the theorem. //// 


When X is separable (i.e., when there is a countable dense set in X), 
then the conclusion of the Banach-Alaoglu theorem can be strengthened by 
combining it with the following fact: 
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3.16 Theorem If X is a separable topological vector space, if K < X*, 
and if K is weak*-compact, then K is metrizable, in the weak*-topology. 


Warning: It does not follow that X* itself is metrizable in its weak*- 
topology. In fact, this is false whenever X is an infinite-dimensional Banach 
space. See Exercise 15. 


PROOF. Let {x,} be a countable dense set in X. Put f,(A) = Ax,, for 
A € X*. Each f, is weak*-continuous, by the definition of the weak*- 
topology. If f,(A) =/,(A’) for all n, then Ax, = A’x, for all n, which 
implies that A = A’, since both are continuous on X and coincide on 
a dense set. 

Thus {f,} is a countable family of continuous functions that 
separates points on X*. The metrizability of K now follows from (c) of 
Section 3.8. /T// 


3.17 Theorem If V is a neighborhood of 0 in a separable topological 
vector space X, and if {A,} is a sequence in X* such that 


|A,x| <1 (xe V,n = 1, 2, 3,...), 
then there is a subsequence {A,,} and there is a A € X* such that 
Ax=limA,,x (xe X). 


i 0 


In other words, the polar of V is sequentially compact in the weak*- 
topology. 


PROOF. Combine Theorems 3.15 and 3.16. //// 


The next application of the Banach-Alaoglu theorem involves the 
Hahn-Banach theorem and a category argument. 


3.18 Theorem In a locally convex space X, every weakly bounded set is 
originally bounded, and vice versa. 


Part (d) of Exercise 5 shows that the local convexity of X cannot be 
omitted from the hypotheses. 


PROOF. Since every weak neighborhood of 0 in X is an original neigh- 
borhood of 0, it is obvious from the definition of “bounded” that 
every originally bounded subset of X is weakly bounded. The con- 
verse is the nontrivial part of the theorem. 
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Suppose E c X is weakly bounded and U is an original neigh- 
borhood of 0 in X. 

Since X is locally convex, there is a convex, balanced, original 
neighborhood V of 0 in X such that Vc U. Let K c X* be the polar 


of V: 

(1) K = {Ae X*: |Ax| <1 forall xeV}. 
We claim that 

(2) V={xeX:|Ax| <1 forall Ae K}. 


It is clear that V is a subset of the right side of (2) and hence so is V, 
since the right side of (2) is closed. Suppose x, ¢X but x, ¢V. 
Theorem 3.7 (with Vin place of B) then shows that Ax, > 1 for some 
A € K. This proves (2). 

Since E is weakly bounded, there corresponds to each A € X* a 
number y(A) < oo such that 


(3) | Ax | < (A) (x € E). 


Since K is convex and weak*-compact (Theorem 3.15) and since the 
functions A > Ax are weak*-continuous, we can apply Theorem 2.9 
(with X* in place of X and the scalar field in place of Y) to conclude 
from (3) that there is a constant y < oo such that 


(4) |Ax|<y  (xeE,A€ Ky). 

Now (2) and (4) show that y~'x e Vc U for all x € E. Since V is 
balanced, 

(5) EctVctU (t >). 

Thus E 1s originally bounded. //// 


Corollary. If X is a normed space, if E < X, and if 
(6) sup | Ax| < oo (A e€ X*) 


xeE 


then there exists y < oo such that 
(7) Ixl<y (xe 8). 


PROOF. Normed spaces are locally convex; (6) says that E is weakly 
bounded, and (7) says that E is originally bounded. //// 


We now turn to the question: What can one say about the convex 
hull H of a compact set K? Even in a Hilbert space, H need not be closed, 
and there are situations in which H is not compact (Exercises 20, 22). In 
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Fréchet spaces the latter pathology does not occur (Theorem 3.20). The 
proof of this will depend on the fact that a subset of a complete metric 
space is compact if and only if it is closed and totally bounded (Appendix 
A4). 


3.19 Definitions (a) If X is a vector space and E c X, the convex hull 
of E will be denoted by co(E). Recall that co(E) is the intersection of all 
convex subsets of X which contain E. Equivalently, co(E) is the set of all 
finite convex combinations of members of E. 

(b) If X is a topological vector space and E c X, the closed convex 
hull of E, written co(E), is the closure of co(E). 

(c) A subset E of a metric space X is said to be totally bounded if E 
lies in the union of finitely many open balls of radius ¢, for every ¢ > 0. 

The same concept can be defined in any topological vector space, 
metrizable or not: 


(d) A set E in a topological vector space X is said to be totally 
bounded if to every neighborhood V of 0 in X corresponds a finite set F 
such that Ec F+ V. 


If X happens to be a metrizable topological vector space, then these 
two notions of total boundedness coincide, provided we restrict ourselves to 
invariant metrics that are compatible with the topology of X. (The proof of 
this is as in Section 1.25.) 


3.20 Theorem 


(a) If A,,..., A, are compact convex sets in a topological vector space X, 
then co(A, U-:: U A,) is compact. 


(b) If X is a locally convex topological vector space and E c X is totally 
bounded, then co(E) is totally bounded. 


(c) If X isa Fréchet space and K < X is compact, then co(K) is compact. 
(d) If K isacompact set in R", then co(K) is compact. 


PROOF. (a) Let S be the simplex in R" consisting of all s = (sy, ..., s,) 
with s;>0, s,;+°::+s,=1. Put A=A,x-::xA,. Define 
f:SxA-X by 


(1) f(s, a) =s,a, +°:'4+5,4, 


and put K = f(S x A). 
It is clear that K is compact and that K cco(A, U::: U A,). 
We will see that this inclusion is actually an equality. 
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If (s, a) and (t, b) are in S x A and ifa>0, B>0,a+ 8 =1, 
then 


(2) af(s, a) + Bf(t, b) =f(u, c), 
where u = as + Bt € Sand c € A, because 


(3) pac TU gy, Gina 

as; + Bt; 
This shows that K is convex. Since A; < K for each i [take s; = 1 in 
(1), s; =0 for j # i], the convexity of K implies that co(A,; U-:: U 
A,) < K. This proves (a). 

(b) Let U be a neighborhood of 0 in X. Choose a convex neigh- 
borhood V of 0 in X such that V+VcU. Then Ec F+V for 
some finite set F < X. Hence E < co(F) + V. The latter set is convex. 
It follows that 


(4) co(E) < co(F) + V. 


But co(F) is compact [a special case of (a)], and therefore co(F) < 
F, + V for some finite set F, < X. Thus 


(5) cof) c F,+V+VcF,+U. 


Since U was arbitrary, co(E) is totally bounded. 

(c) Closures of totally bounded sets are totally bounded in every 
metric space, and hence are compact in every complete metric space 
(Appendix A4). So if K is compact in a Fréchet space, then K is obvi- 
ously totally bounded; hence co(K) is totally bounded, by (b), and 
therefore co(K) is compact. 

(d) Let S be the simplex in R"** consisting of all t =(t,, ..., 
th+i) With t;>0 and } t;=1. Let K be compact, K < R". By the 
proposition that follows, x € co(K) if and only if 


(6) X= EX, to Ht byt X41 


for some t € S and x; € K (1 <i<n-+ 1). In other words, co(K) is the 
image of S x K"** under the continuous mapping 


(7) (t, X45 -0+) Xpta) bX, Ho Het Xn a 


Hence co(K) is compact. [ff 


Proposition. If E < R" and x € co(E), then x lies in the convex hull of 
some subset of E which contains at most n + 1 points. 


PROOF. It is enough to show that if k>n and x = ){*! t;x; is a 
convex combination of some k + 1 vectors x; € R", then x is actually a 
convex combination of some k of these vectors. 
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Assume, with no loss of generality, that t; > 0 for1 <i<k+1. 
The null space of the linear map 


k+1 k+1 
(8) (ay os sd (Y a; Xj, 5 ai) 
1 1 


which sends R**' into R" x R, has positive dimension, since k > n. 
Hence there exists (a,,..., a4), with some a; # 0, so that ¥° a,x; = 0 
and ) a; =0. Since t; > for all i, there is a constant 4 such that 
| Aa;| <t; for all i and Aa; = t; for at least one j. Setting c; = t; — Aa;, 
we conclude that x = ) c;x; and that at least one c, is 0; note also 
that ¥\ c; = >. t; = 1 and that c; > 0 for alli. //// 


The following analogue of part (b) of the separation theorem 3.4 will 
be used in the proof of the Krein-Milman theorem. 


3.21 Theorem Suppose X is a topological vector space on which X* 
separates points. Suppose A and B are disjoint, nonempty, compact, convex 
sets in X. Then there exists A € X* such that 


(1) sup Re Ax < inf Re Ay. 


xeA yeB 


Note that part of the hypothesis is weaker than in (b) of Theorem 3.4 
(since local convexity of X implies that X* separates points on X); to make 
up for this, it is now assumed that both A and B are compact. 


PROOF. Let X,, be X with its weak topology. The sets A and B are 
evidently compact in X,,. They are also closed in X,, (because X,, is a 
Hausdorff space). Since X,, is locally convex, (b) of Theorem 3.4 can 
be applied to X,, in place of X; it gives us a A € (X,,)* that satisfies 
(1). But we saw in Section 3.11 (as a consequence of Theorem 3.10) 
that (X,,)* = X™*. //// 


3.22 Extreme points Let K be a subset of a vector space X. A non- 
empty set S c K 1s called an extreme set of K if no point of S is an internal 
point of any line interval whose end points are in K, except when both end 
points are in S. Analytically, the condition can be expressed as follows: If 
xe K,yeK,0<t<1l,and 


(1 —t)x + tyeS, 


thenx eSandyesS. 

The extreme points of K are the extreme sets that consist of just one 
point. 

The set of all extreme points of K will be denoted by E(K). 


CHAPTER 3: CONVEXITY 75 


The following two theorems show that under certain conditions E(K) 
is quite a large set. 


3.23 The Krein-Milman theorem Suppose X is a topological vector 
space on which X* separates points. If K is a nonempty compact convex Set in 
X, then K is the closed convex hull of the set of its extreme points. 

In symbols, K = Co(E(K)). 


PROOF. Let FY be the collection of all compact extreme sets of K. Since 
K € 7, PP # SM. We shall use the following two properties of F: 


(a) The intersection S of any nonempty subcollection of P is a member 
of 7, unless S = SO. 
(b) IfSEP,A € X*, wis the maximum of Re A on S, and 


S, = {x e S: Re Ax = 1}, 
then S, € F. 


The proof of (a) is immediate. To prove (b), suppose tx + 
(i—ty=zeS,,xeK, yeK,0<t< 1. SincezeS and Se FY, we 
have x € S and y € S. Hence Re Ax < p, Re Ay < wu. Since Re Az = ps 
and A is linear, we conclude: Re Ax = w = Re Ay. Hence x € S, and 
y € S,. This proves (5). 

Choose some S € Y. Let # be the collection of all members of 
Ff that are subsets of S. Since S c 7’, F is not empty. Partially order 
Ff by set inclusion, let Q be a maximal totally ordered subcollection 
of F’, and let M be the intersection of all members of 2. Since Q is 
a collection of compact sets with the finite intersection property, 
M # ©. By (a), M € #.. The maximality of Q implies that no proper 
subset of M belongs to #. It now follows from (b) that every A e¢ X* 
is constant on M. Since X* separates points on X, M has only one 
point. Therefore M is an extreme point of K. 

We have now proved that 


(1) EK) \AS#B 


for every S € ZY. In other words, every compact extreme set of K con- 
tains on extreme point of K. 

Since K is compact and convex (the assumed convexity of K will 
now be used for the first time), we have 


(2) CO(E(K)) < K 


and this shows that co(E(K)) is compact. 
Assume, to reach a contradiction, that some x, €K is not 
in CO(E(K)). Theorem 3.21 furnishes then a Ae X* such that 
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Re Ax < Re Axo for every x € co(E(K)). If K, is defined as in (5), then 
K, € #. Our choice of A shows that K, is disjoint from co(E(K)), and 
this contradicts (1). //// 


Remark. The convexity of K was used only to show that co(E(K)) is 
compact. If X were assumed to be locally convex, the compactness of 
co(E(K)) would not be needed, since one could use (b) of Theorem 3.4 
in place of Theorem 3.21. The above argument proves then that K c 
CO(E(K)). The following version of the Krein-Milman theorem is thus 
obtained: 


3.24 Theorem If K is a compact subset of a locally convex space then 
K c COo(E(K)). 


Equivalently, co(K) = Co(E(K)). 
It may happen in this situation that co(K) has extreme points which 


are not in K. (See Exercise 33.) The next theorem shows that this pathology 
cannot occur if Co(K) is compact. Therefore it occurs in no Fréchet space, 
by (c) of Theorem 3.20. 


3.25 Milman’s theorem If K is a compact set in a locally convex space 
X, and if co(K) is also compact, then every extreme point of co(K) lies in K. 


PROOF. Assume that some extreme point p of co(K) is not in K. Then 
there is a convex balanced neighborhood V of 0 in X such that 


(1) (D+VAK=@. 
Choose x,,..., x, in K so that K c |)" (x; + V). Each set 
(2) A;=CoK mn (x,+V)) (lsisn) 


is convex and also compact, since A; < co(K). Also, K < A, U-+: U 
A,,. Part (a) of Theorem 3.20 shows therefore that 


(3) CO(K) < Co(A, U-+:: U A,) = co(A, U-:: U A,). 
But the opposite inclusion holds also, because A; < co(K) for each i. 
Thus 
(4) co(K) = co(A, U-:: U A,). 

In particular, p = t;y, + -:: +tyyy, Where each y, lies in some 
A,, each t; is positive, and >, t; = 1. The grouping 


t>y2 + °°’ ttn yn 


5 =tiy,+(U-t 
(5) Pp Vi + ( 1) reer ae 
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exhibits p as a convex combination of two points of co(K), by (4). 
Since p is an extreme point of co(K), we conclude from (5) that y, = p. 
Thus, for some i, 


(6) peA,cx,+VoK+V, 


which contradicts (1). [Note that A; < x; + V, by (2), because V is 
convex. | Lit 


Vector-Valued Integration 


Sometimes it is desirable to be able to integrate functions f that are defined 
on some measure space Q (with a real or complex measure pv) and whose 
values lie in some topological vector space X. The first problem is to associ- 
ate with these data a vector in X that deserves to be called 


| fan 
Q 


1.e., which has at least some of the properties that integrals usually have. 
For instance, the equation 


(| rau) = [oan du 
Q Q 


ought to hold for every A € X*, because it does hold for sums, and because 
integrals are (or ought to be) limits of sums in some sense or other. In fact, 
our definition will be based on this single requirement. 

Many other approaches to vector-valued integration have been 
studied in great detail; in some of these, the integrals are defined more 
directly as limits of sums (see Exercise 23). 


3.26 Definition Suppose p is a measure on a measure space Q, X is a 
topological vector space on which X* separates points, and f is a function 
from Q into X such that the scalar functions Af are integrable with respect 
to p, for every A € X*; note that Afis defined by 


(1) (ASQ) = AY(@) = (a € Q). 


If there exists a vector y € X such that 


(2) Ky = joy ) du 


for every A € X*, then we define 


(3) | fau=y 
Q 


78 PART I: GENERAL THEORY 


Remarks. It is clear that there is at most one such y, because X* 
separates points on X. Thus there is no uniqueness problem. 

Existence will be proved only in the rather special case (sufficient 
for many applications) in which Q is compact and f is continuous. In 
that case, f(Q) is compact, and the only other requirement that will be 
imposed is that the closed convex hull of f{(Q) should be compact. By 
Theorem 3.20, this additional requirement is automatically satisfied 
when X is a Fréchet space. 


Recall that a Borel measure on a compact (or locally compact) Haus- 
dorff space Q is a measure defined on the o-algebra of all Borel sets in Q; 
this is the smallest o-algebra that contains all open subsets of Q. A probabil- 
ity measure is a positive measure of total mass 1. 


3.27 Theorem Suppose 


(a) X isa topological vector space on which X* separates points, and 
(b) isa Borel probability measure on a compact Hausdorff space Q. 


If f: Q@—X is continuous and if co(f(Q)) is compact in X, then the 
integral 


(1) y= | au 
Q 


exists, in the sense of Definition 3.26. 
Moreover, y € co(f (Q)). 


Remark. If v is any positive Borel measure on Q, then some scalar 
multiple of v is a probability measure. The theorem therefore holds 
(except for its last sentence) with v in place of uw. It can then be 
extended to real-valued Borel measures (by the Jordan decomposition 
theorem) and (if the scalar field of X is ¢) to complex ones. 


Exercise 24 gives another generalization. 


PROOF. Regard X as a real vector space. Put H = co( f(Q)). We have 
to prove that there exists y € H such that 


(2) Ay = | (Af) du 
Q 


for every A € X*. 

Let L = {A,,..., A,} be a finite subset of X*. Let E, be the set 
of all y e H that satisfy (2) for every A € L. Each E, is closed (by the 
continuity of A) and is therefore compact, since H is compact. If no E;, 
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is empty, the collection of all E, has the finite intersection property. 
The intersection of all E, is therefore not empty, and any y in it 
satisfies (2) for every A € X*. It is therefore enough to prove E, # @. 

Regard L = (A,,..., A,) aS a mapping from X into R", and put 
K = L(f(Q)). Define 


(3) m= [OAD du (1 <i<n). 
Q 


We claim that the point m = (m,,..., m,) lies in the convex hull of K. 

If t = (t,,..., t,) € R” is not in this hull, then [by Theorem 3.20 
and (b) of Theorem 3.4 and the known form of the linear functionals 
on R"] there are real numbers c,,..., c, such that 


(4) y C;u; < > Cb; 
ifu = (u,,..., u,) € K. Hence 
(5) Yers@< Yet @eQ). 


Since p is a probability measure, integration of the left side of (5) gives 
Y cpm; < ¥. c;t;. Thus t # m. 

This shows that m lies in the convex hull of K. Since 
K = L(f(Q)) and L is linear, it follows that m = Ly for some y in the 
convex hull H of f(Q). For this y we have 


(6) Ay=m= [ (A) du (I <i<n). 
Q 


Hence y € E,. This completes the proof. [1/1 


3.28 Theorem Suppose 


(a) X isa topological vector space on which X* separates points, 


Q is acompact subset of X, and 
the closed convex hull H of Q is compact. 


Then y € H if and only if there is a regular Borel probability measure pu 


on Q such that 


y= | x d(x). 
Q 


Remarks. The integral is to be understood as in Definition 3.26, with 


f(x) = x. 
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Recall that a positive Borel measure on Q is said to be regular if 
(2) H(E) = sup {u(K): K c E} = inf {u(G): E < G} 


for every Borel set E < Q, where K ranges over the compact subsets 
of E and G ranges over the open supersets of E. 

The integral (1) represents every y € H as a “weighted average” 
of Q, or as the “center of mass” of a certain unit mass distributed 
over Q. 

We stress once more that (c) follows from (b) if X is a Fréchet 
space. 


PROOF. Regard X again as a real vector space. Let C(Q) be the Banach 
space of all real continuous functions on Q, with the supremum norm. 
The Riesz representation theorem identifies the dual space C(Q)* with 
the space of all real Borel measures on Q that are differences of 
regular positive ones. With this identification in mind, we define a 


mapping 


(3) gp: C(Q)* > xX 
by 
(4) P(u) = [> d(x). 


Let P be the set of all regular Borel probability measures on Q. 
The theorem asserts that ¢(P) = H. 

For each x € Q, the unit mass 6, concentrated at x belongs to P. 
Since $(6,) = x, we see that Qc @(P). Since @ is linear and P is 
convex, it follows that H < @(P), where H is the convex hull of Q. By 
Theorem 3.27, ¢(P) < H. Therefore all that remains to be done is to 
show that @(P) is closed in X. 

This is a consequence of the following two facts: 


(i) P is weak*-compact in C(Q)*. 
(ii) The mapping ¢ defined by (4) is continuous if C(Q)* is given its 
weak*-topology and if X is given its weak topology. 


Once we have (i) and (ii), it follows that @(P) is weakly compact, 
hence weakly closed, and since weakly closed sets are strongly closed, 
we have the desired conclusion. 

To prove (i), note that 


(5) Pe yu { du) <i hi <1| 
Q 


and that this larger set is weak*-compact, by the Banach-Alaoglu 
theorem. It is therefore enough to show that P is weak*-closed. 
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If h e C(Q) and h > O, put 


(6) E, = \H [i du > of 
Q 


Since yp — | h du is continuous, by the definition of the weak*- 
topology, each E, is weak*-closed. So is the set 


(7) b= fu: | 1du=1} 
Q 


Since P is the intersection of E and the sets E,, P is weak*-closed. 

To prove (ii) it is enough to prove that ¢ is continuous at the 
origin, since @ is linear. Every weak neighborhood of 0 in X contains 
a set of the form 


(8) W={yeX:|A;y|<r, for 1<i<n}, 
where A; € X* and r; > 0. The restrictions of the A, to Q lie in C(Q). 


Hence 
(9) V= \! Ee C(Q)*: [A dy 


is a weak*-neighborhood of 0 in C(Q)*. But 


<r; for I sisal 


(10) | A; du = af | x aus) = A; 9(u), 

Q Q 
by Definition 3.26. It follows from (8), (9), and (10) that @(V) < W. 
Hence ¢ is continuous. //// 


The following simple inequality sharpens the last assertion in the 
statement of Theorem 3.27. 


3.29 Theorem: Suppose Q is a compact Hausdorff space, X is a Banach 
space, f : Q > X is continuous, and p is a positive Borel measure on Q. Then 


| saul < [is dy. 
Q Q 


PROOF. Put y=] fdyu. By the corollary to Theorem 3.3, there is a 
A € X* such that Ay = ||y|| and | Ax| < ||x|| for all x e X. In particu- 
lar, 


| Af(s)| < I F()II 
for all s € Q. By Theorem 3.27, it follows that 


yl = Ay = jan dus jun dy. //// 
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Holomorphic Functions 


In the study of Banach algebras, as well as in some other contexts, it is 
useful to enlarge the concept of holomorphic function from complex-valued 
ones to vector-valued ones. (Of course, one can also generalize the domains, 
by going from ¢ to ¢” and even beyond. But this is another story.) There 
are at least two very natural definitions of “holomorphic” available in this 
general setting, a “weak” one and a “strong” one. They turn out to define 
the same class of functions if the values are assumed to lie in a Fréchet 
space. 


3.30 Definition Let Q be an open set in ¢ and let X be a complex 
topological vector space. 


(a) A function f:Q—- X is said to be weakly holomorphic in Q if Af is 
holomorphic in the ordinary sense for every A € X*. 


(b) A function f: Q -— X is said to be strongly holomorphic in Q if 
ion LO) SO) 
w—Z 


wz 


exists (in the topology of X) for every z € Q. 


Note that the above quotient is the product of the scalar (w — z)~' 
and the vector f(w) — f(z) in X. 

The continuity of the functionals A that occur in (a) makes it obvious 
that every strongly holomorphic function is weakly holomorphic. The con- 
verse is true when X is a Fréchet space, but it is far from obvious. (Recall 
that weakly convergent sequences may very well fail to converge originally.) 
The Cauchy theorem will play an important role in this proof, as will 
Theorem 3.18. 

The index of a point z € ¢ with respect to a closed path I that does 
not pass through z will be denoted by Ind, (z). We recall that 


1 d 
I 


All paths considered here and later are assumed to be piecewise contin- 
uously differentiable, or at least rectifiable. 


3.31 Theorem Let © be open in ¢, let X be a complex Fréchet space, 
and assume that 


f:Q->~X 


is weakly holomorphic. The following conclusions hold: 


(a) fis strongly continuous in Q. 
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(b) The Cauchy theorem and the Cauchy formula hold: If T is a closed path 
in Q such that Ind; (w) = 0 for every w ¢ Q, then 


(1) | 10 a =o, 

and 

2) f= se [6-9 VO a 
ni Jr 


if z € Q and Ind, (z) = 1. If 1, and TV, are closed paths in Q such that 
Ind;, (w) = Ind;, (w) 


for every w ¢ Q, then 


(3) { f (0) de = | F(C) dC. 


(c) fis strongly holomorphic in Q. 


The integrals in (b) are to be understood in the sense of Theorem 3.27. 
Either one can regard df as a complex measure on the range of I (a 
compact subset of ¢), or one can parametrize I and integrate with respect 
to Lebesgue measure on a compact interval in R. 


PROOF. (a) Assume 0 € Q. We shall prove that fis strongly contin- 
uous at 0. Define 


(4) A, = {zé€¢:|z| <r}. 


Then A,, < Q for some r > 0. Let I be the positively oriented bound- 
ary of A,,. 
Fix A € X*. Since Af is holomorphic, 


(Af\(z) - (AMO) 1 | ANO 


©) 5 om Cae 


if O<|z|<2r. Let M(A) be the maximum of |Af| on A,,. If 
0 <|z|<,r, it follows that 


(6) |z~*ALS() —fO)]| <r M(A). 
The set of all quotients 


{om (z) —f(0) | 


(7) 0<lzIsrt 
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is therefore weakly bounded in X. By Theorem 3.18, this set is also 
strongly bounded. Thus if V is any (strong) neighborhood of 0 in X, 
there exists t < 00 such that 


(8) f(z) —f(O) € z2tV (0 <|z|</r). 


Consequently, f(z) > f(0) strongly, as z > 0. [It may be of some inter- 
est to observe that the proof of (a) used only the local convexity of X. 
Neither metrizability nor completeness has played a role so far. ] 

This was the crux of the matter. The rest is now almost auto- 
matic. 

(b) By (a) and Theorem 3.27, the integrals in (1) to (3) exist. 
These three formulas are correct (by the theory of ordinary holo- 
morphic functions) if f is replaced in them by Af, where A is any 
member of X*. The formulas are therefore correct as stated, by Defi- 
nition 3.26. 

(c) Assume, as in the proof of (a), that A,, < Q, and choose IT 
as in (a). Define 


1 
(9) pase | ora 


The Cauchy formula (2) shows, after a small computation, that 


(10) POALO = y + 22) 


if QO <|z| < 2r, where 


(11) g(z) = ~ [ [2re*(2re’® — z)]~* f(2re’’) dé. 


Let V be a convex balanced neighborhood of 0 in X. Put 
K = {f(Q): |¢| = 2r}. Then K is compact, so that K c tV for some 
t< o.Ifs =tr 7 and |z| <7, it follows that the integrand (11) lies in 
sV for every 6. Thus g(z) € sV if |z| <r. The left side of (10) therefore 
converges strongly to y, as z > 0. //// 


The following extension of Liouville’s theorem concerning bounded 


entire functions does not even depend on Theorem 3.31. It can be used in 
the study of spectra in Banach algebras. (See Exercise 10, Chapter 10.) 


3.32 Theorem Suppose X is a complex topological vector space on which 
X* separates points. Suppose f: € — X is weakly holomorphic and f(€) is a 
weakly bounded subset of X. Then f is constant. 
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PROOF. For every A € X*, Af is a bounded (complex-valued) entire 
function. If z € €, it follows from Liouville’s theorem that 


Af (z) = Af(O). 


Since X* separates points on X, this implies f(z) = f(0), for every 
ze. [/// 


Part (d) of Exercise 5 describes a weakly bounded set which is not 
originally bounded, in an F-space X on which X* separates points. 
Compare with Theorem 3.18. 


Exercises 


1. Call a set H < R" a hyperplane if there exist real numbers a,, ..., a,, c (with 
a; 0 for at least one i) such that H consists of all points x = (x,, ..., x,) that 
satisfy )) a,x; =. 

Suppose E is a convex set in R", with nonempty interior, and y is a 
boundary point of E. Prove that there is a hyperplane H such that y e H and E 
lies entirely on one side of H. (State the conclusion more precisely.) Suggestion: 
Suppose 0 is an interior point of E, let M be the one-dimensional subspace that 
contains y, and apply Theorem 3.2. 

2. Suppose I? = °([—1, 1]), with respect to Lebesgue measure. For each scalar «, 
let E, be the set of all continuous functions f on [—1, 1] such that f(0) = «a. 
Show that each E, is convex and that each is dense in L’. Thus E, and E, are 
disjoint convex sets (if « 4 B) which cannot be separated by any continuous 
linear functional A on [?. Hint: What is A(E,)? 

3. Suppose X is a real vector space (without topology). Call a point x» € A c X an 
internal point of A if A — x, is an absorbing set. 

(a) Suppose A and B are disjoint convex sets in X, and A has an internal point. 
Prove that there is a nonconstant linear functional A on X such that 
A(A) © A(B) contains at most one point. (The proof is similar to that of 
Theorem 3.4.) 

(b) Show (with X = R?, for example) that it may not be possible to have A(A) 
and A(B) disjoint, under the hypotheses of (a). 

4. Let 7” be the space of all real bounded functions x on the positive integers. Let 
t be the translation operator defined on 7” by the equation 


(tx)(n) = x(n + 1) (n = 1, 2, 3,...). 


Prove that there exists a linear functional A on ?@@ (called a Banach limit) such 
that 

(a) Atx = Ax, and 

(b) lim inf x(n) < Ax < lim sup x(n) 


for every x € ¢%. 
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Suggestion: Define 


_ x(1) + +++ + x(n) 
7 n 


A,x 


M = {xe @®: lim A,x = Ax exists} 


n—>@ 


P(x) = lim sup A, x 


n> 


and apply Theorem 3.2. 


. For 0 < p< ~, let #? be the space of all functions x (real or complex, as the 


case may be) on the positive integers, such that 


fe @) 


y° | x(n) |? < 0. 
n=1 
For 1 < p < ©, define ||x||, = {); | x(n) |?}1/”, and define ||x||,, = sup, | x(n)|. 
(a) Assume 1 < p < oo. Prove that ||x||, and ||x||,, make 7? and ¢” into Banach 
spaces. If p~' + q~' = 1, prove that (7”)* = @4, in the following sense: There 
is a one-to-one correspondence A + y between (7?)* and 74, given by 


Ax => x(n)y(n) (x € #”). 


(b) Assume 1 < p< oo and prove that 7? contains sequences that converge 
weakly but not strongly. 

(c) On the other hand, prove that every weakly convergent sequence in 7! con- 
verges strongly, in spite of the fact that the weak topology of 7’ is different 
from its strong topology (which is induced by the norm). 

(d) If 0 < p < 1, prove that 7, metrized by 


d(x, y) = ), | x(n) — y(n)|?, 
n=1 
is a locally bounded F-space which is not locally convex but that (7?)* 
nevertheless separates points on /?. (Thus there are many convex open sets 
in ¢? but not enough to form a base for its topology.) Show that (7?)* = @®, 
in the same sense as in (a). Show also that the set of all x with Z| x(n)| < 1 is 
weakly bounded but not originally bounded. 

(e) For0 <p <1, let t, be the weak*-topology induced on 7” by #?; see (a) and 
(d). If O0<p<r< 1, show that t, and t, are different topologies (is one 
weaker than the other?) but that they induce the same topology on each 
norm-bounded subset of 7°. Hint: The norm-closed unit ball of 7° 1s 
weak*-compact. 


. Put f(t) =e™ (-n<t<n); let 2 =E(—7z, 2), with respect to Lebesgue 


measure. If 1 < p < oo, prove that f, - 0 weakly in I’, but not strongly. 


. L*({0, 1]) has its norm topology (|| f||,, is the essential supremum of | f |) and its 


weak*-topology as the dual of I’. Show that C, the space of all continuous 
functions on [0, 1], is dense in L® in one of these topologies but not in the 
other. (Compare with the corollaries to Theorem 3.12.) Show the same with 
“closed” in place of “dense.” 
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8. Let C be the Banach space of all complex continuous functions on (0, 1], with 


10. 


11. 


the supremum norm. Let B be the closed unit ball of C. Show that there exist 
continuous linear functionals A on C for which A(B) is an open subset of the 
complex plane; in particular, | A| attains no maximum on B. 


. Let E c 2'(—n, 2) be the set of all functions 


te. A(t) a eimt +m ein 


where m, n are integers and 0 < m <n. Let E, be the set of all g e¢ I? such that 

some sequence in E converges weakly to g. (E, is called the weak sequential 

closure of E.) 

(a) Find all g € E,. 

(b) Find all g in the weak closure E,, of E. 

(c) Show that 0 € E,, but 0 is not in E,, although 0 lies in the weak sequential 
closure of E,. 

This example shows that a weak sequential closure need not be weakly 
sequentially closed. The passage from a set to its weak sequential closure is 
therefore not a closure operation, in the sense in which that term is usually used 
in topology. (See also Exercise 28.) 

Represent 7’ as the space of all real functions x on S = {(m, n): m> 1, n> 1}, 
such that 


lll, =X |x(m, n)| < o. 


Let cy be the space of all real functions y on S such that y(m, n)—0 as 
m+n-— oo, with norm ||y||,, = sup | ym, n)|. 

Let M be the subspace of 7! consisting of all x ¢ ¢' that satisfy the equa- 
tions 


mx(m, 1)= ) x(m,n) (m= 1, 2, 3, ...). 
n=2 

(a) Prove that 7’ = (c,)*. (See also Exercise 24, Chapter 4.) 

(b) Prove that M is a norm-closed subspace of 7’. 

(c) Prove that M is weak*-dense in 7¢’ [relative to the weak*-topology given 
by (a)]. 

(d) Let B be the norm-closed unit ball of 7'. In spite of (c), prove that the 
weak*-closure of M  B contains no ball. Suggestion: If 6 > 0 and m > 2/6, 
then 


6 
|x(m, 1)| eee? 
m 2 


if x € M 7 B, although x(m, 1) = 6 for some x € 6B. Thus 6B is not in the 
weak*-closure of M m B. Extend this to balls with other centers. 

Put x9(m, 1) = m~?, x.(m, n) = 0 when n > 2. Prove that no sequence in M is 
weak*-convergent to X,, in spite of (c). Hint: Weak*-convergence of {x,} to 
Xo implies that x {m, n) — x(m, n) for all m, n, as j > 00, and that {||x,|,} is 
bounded. 

Let X be an infinite-dimensional Fréchet space. Prove that X*, with its weak*- 
topology, is of the first category in itself. 
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Show that the norm-closed unit ball of cg is not weakly compact; recall that 
(co)* = ¢' (Exercise 10). 
Put fy(t) = N~* ¥-N2, e'". Prove that fy > 0 weakly in I?(—2, 2). 
By Theorem 3.13, some sequence of convex combinations of the fy, con- 
verges to 0 in the I?-norm. Find such a sequence. Show that gy = N-‘(f, + °°: 
+ fy) will not do. 
(a) Suppose Q is a locally compact Hausdorff space. For each compact K <Q 
define a seminorm p, on C(Q), the space of all complex continuous functions 
on Q, by 


Px(f) = sup {| f(x)|: x € K}. 


Give C(Q) the topology induced by this collection of seminorms. Prove that 
to every A € C(Q)* correspond a compact K <Q and a complex Borel 
measure p on K such that 


Af= {r due [fe C(Q))). 


(b) Suppose Q is an open set in ¢. Find a countable collection I’ of measures 
with compact support in Q such that H(Q) (the space of all holomorphic 
functions in Q) consists of exactly those f € C(Q) which satisfy | f du = 0 for 
every wel. 

Let X be a topological vector space on which X* separates points. Prove that 

the weak*-topology of X* is metrizable if and only if X has a finite or countable 

Hamel basis. (See Exercise 1, Chapter 2 for the definition.) 

Prove that the closed unit ball of L’ (relative to Lebesgue measure on the unit 

interval) has no extreme points but that every point on the “surface” of the unit 

ball in I? (1 < p < o) is an extreme point of the ball. 

Determine the extreme points of the closed unit ball of C, the space of all con- 

tinuous functions on the unit interval, with the supremum norm. (The answer 

depends on the choice of the scalar field.) 

Let K be the smallest convex set in R* that contains the points (1, 0, 1), (1, 0, 

— 1), and (cos @, sin @, 0), for 0 < @ < 2x. Show that K is compact but that the 

set of all extreme points of K is not compact. Does such an example exist in R?? 

Suppose K is a compact convex set in R". Prove that every x € K 1s a convex 

combination of at most n+ 1 extreme points of K. Suggestion: Use induction 

on n. Draw a line from some extreme point of K through x to where it leaves K. 

Use Exercise 1. 

Let {u,,u,,u3,...} be a sequence of pairwise orthogonal unit vectors in a 

Hilbert space. Let K consist of the vectors 0 and n™‘u, (n > 1). Show that (a) K 

is compact; (b) co(K) is bounded; (c) co(K) is not closed. Find all extreme points 

of co(K). 

If 0 < p <1, every fe I? (except f = 0) is the arithmetic mean of two functions 

whose distance from 0 is less than that of f (See Section 1.47.) Use this to 

construct an explicit example of a countable compact set K in I? (with 0 as its 
only limit point) which has no extreme point. 

If 0< p< 1, show that 7? contains a compact set K whose convex hull is 

unbounded. This happens in spite of the fact that (7?)* separates points on 7°; 
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see Exercise 5. Suggestion: Define x, € 7? by 
x,(n) = n?-}, x,(m) = 0 ifm #n. 
Let K consist of 0, x,, x,,x3,.... If 
Ww =No'(0x, os + Xy), 


show that {y,} is unbounded in /?. 

Suppose yp is a Borel probability measure on a compact Hausdorff space Q, X is 
a Fréchet space, and f: Q > X is continuous. A partition of Q is, by definition, a 
finite collection of disjoint Borel subsets of Q whose union is Q. Prove that to 
every neighborhood V of 0 in X there corresponds a partition {E;} such that the 
difference 


_ | Fie) 
Q i 


lies in V for every choice of s; € E;. (This exhibits the integral as a strong limit 
of “Riemann sums.”) Suggestion: Take V convex and balanced. If A € X* and if 
| Ax| < 1 for every x € V, then | Az| < 1, provided that the sets E; are chosen so 
that f(s) — f(t) € V whenever s and t lie in the same E,. 

In addition to the hypotheses of Theorem 3.27, assume that T is a continuous 
linear mapping of X into a topological vector space Y on which Y* separates 
points, and prove that 


r | fau= | cr du 
Q Q 


Hint: AT € X* for every A € Y*. 

Let E be the set of all extreme points of a compact set K in a topological vector 
space X on which X* separates points. Prove that to every y € K corresponds a 
regular Borel probability measure 1 on Q = E such that 


y= | x d(x). 
Q 


Suppose Q is a region in ¢, X is a Fréchet space, and f: Q > X is holomorphic. 

(a) State and prove a theorem concerning the power series representation of f, 
that is, concerning the formula f(z) = )) (z — a)"c,, where c, € X. 

(b) Generalize Morera’s theorem to X-valued holomorphic functions. 

(c) For a sequence of complex holomorphic functions in Q, uniform con- 
vergence on compact subsets of Q implies that the limit is holomorphic. 
Does this generalize to X-valued holomorphic functions? 

Suppose {a} is a bounded set of distinct complex numbers, f(z) = ){¢ c, 2" is an 

entire function with every c, # 0, and 


gAZ) = f (a; 2). 


Prove that the vector space generated by the functions g; is dense in the Fréchet 
space H(C) defined in Section 1.45. 
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Suggestion: Assume y is a measure with compact support such that 
{ 9; du = 0 for all i. Put 


o(w) = [ys (wz) du(z) (we ). 


Prove that ¢(w) = 0 for all w. Deduce that ) z" du(z) = 0 for n = 1, 2, 3,.... Use 

Exercise 14. 

Describe the closed subspace of H(C) generated by the functions g; if some 

of the c, are 0. 

Suppose X is a Fréchet space (or, more generally, a metrizable locally convex 

space). Prove the following statements: 

(a) X* is the union of countably many weak*-compact sets E,. 

(b) If X is separable, each E, is metrizable. The weak*-topology of X* is there- 
fore separable, and some countable subset of X* separates points on X. 
(Compare with Exercise 15.) 

(c) If K is a weakly compact subset of X and if x, € K is a weak limit point of 
some countable set E c K, then there is a sequence {x,} in E which con- 
verges weakly to x,. Hint: Let Y be the smallest closed subspace of X that 
contains E. Apply (b) to Y to conclude that the weak topology of K 1 Y is 
metrizable. 

Remark: The point of (c) is the existence of convergent subsequences 
rather than subnets. Note that there exist compact Hausdorff spaces in 
which no sequence of distinct points converges. For an example, see Exercise 
18, Chapter 11. 

Let C(K) be the Banach space of all continuous complex functions on the 

compact Hausdorff space K, with the supremum norm. For p e« K, define A, € 

C(K)* by A, f=f(p). Show that p— A, is a homeomorphism of K into C(K)*, 

equipped with its weak*-topology. Part (c) of Exercise 28 can therefore not be 

extended to weak*-compact sets. 

Suppose that p is an extreme point of some convex set K, and that p= 

t,x, +°°: +t,x,, where )' t; = 1, t; > 0 and x; € K for all i. Prove that x; = p 

for all i. 

Suppose that A,,..., A, are convex sets in a vector space X. Prove that every 

x € co(A, U:-: U A,) can be represented in the form 


X=,a, +°°'+1,4,, 


with a; € A; and t; > O for all i, ¥ t; = 1. 
Let X be an infinite-dimensional Banach space and let S = {x € X: ||x|| = 1} be 
the unit sphere of X. We want to cover S with finitely many closed balls, none 
of which contains the origin of X. Can this be done in (a) every X, (b) some X, 
(c)no Xx? 
Let C(J) be the Banach space of all continuous complex functions on the closed 
unit interval J, with the supremum norm. Let M = C(J)*, the space of all 
complex Borel measures on I. Give M the weak*-topology induced by C(J). 

For each t € I, let e, € M be the “evaluation functional” defined by e, f= 
f(t), and define A e M by Af = J f(s) ds. 
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(a) Show that t — e, is a continuous map from J into M and that K = {e,: te I} 
is a compact set in M. 

(b) Show that A é€ ¢co(K). 

(c) Find all p € co(K). 

(d) Let X be the subspace of M consisting of all finite linear combinations 


CoA + C1Q, + + CA, 


with complex coefficients c;. Note that co(K) < X and that X 1 ¢o(K) 1s 
the closed convex hull of K within X. Prove that A is an extreme point of 
X © CO(K), even though A is not in K. 


CHAPTER 


DUALITY IN 
BANACH 
SPACES 


The Normed Dual of a Normed Space 


Introduction If X and Y are topological vector spaces, @(X, Y) will 
denote the collection of all bounded linear mappings (or operators) of X 
into Y. For simplicity, @(X, X) will be abbreviated to A(X). Each A(X, Y) 
is itself a vector space, with respect to the usual definitions of addition and 
scalar multiplication of functions. (This depends only on the vector space 
structure of Y, not on that of X.) In general, there are many ways in which 
Z(X, Y) can be made into a topological vector space. 

In the present chapter, we shall deal only with normed spaces X and 
Y. In that case, @(X, Y) can itself be normed in a very natural way. When 
Y is specialized to be the scalar field, so that @(X, Y) is the dual space X* 
of X, the above-mentioned norm on &(X, Y) defines a topology on X* 
which turns out to be stronger than its weak*-topology. The relations 
between a Banach space X and its normed dual X* form the main topic of 
this chapter. 


4.1 Theorem Suppose X and Y are normed spaces. Associate to each 
A € BX, Y) the number 


(1) Al] = sup {||Ax||: x € X, [lx] < 1}. 
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This definition of ||A|| makes @(X, Y) into a normed space. If Y is a Banach 
space, so is BX, Y). 


PROOF. Since subsets of normed spaces are bounded if and only if they 
lie in some multiple of the unit ball, ||A|| < oo for every A € A(X, Y). 
If wis a scalar, then («A)(x) = a - Ax, so that 


(2) ]aAl] = [a] IAT. 
The triangle inequality in Y shows that 
(A, + Ag)xl] = |]Ayx + Az xl] < ||Ayxl] + A, x|| 
< (I|Aal] + A2I) il] < All + Adal 
for every x € X with ||x|| < 1. Hence 
(3) JA, + Agll < Ay || + TAI. 


If A #0, then Ax # 0 for some x € X; hence ||A|| > 0. Thus @(X, Y) 
is a normed space. 

Assume now that Y is complete and that {A,} is a Cauchy 
sequence in @(X, Y). Since 


(4) A, x — Ag xl < A, — All [ll 


and since it is assumed that ||A, — A,,|| ~0 as n and m tend to o, 
{A,, x} is a Cauchy sequence in Y for every x e X. Hence 

(5) Ax = lim A, x 

exists. It is clear that A: X — Y is linear. If ¢ > 0, the right side of (4) 


does not exceed ¢||x||, provided that m and n are sufficiently large. It 
follows that 


(6) | Ax — A,, xl] < el]l| 


for all large m. Hence ||Ax|| < (||A,,|| + &)||x||, so that A € A(X, Y), 
and ||A — A,,|| < «. Thus A,, > A in the norm of @(X, Y). This estab- 
lishes the completeness of @(X, Y). HH 


4.2 Duality It will be convenient to designate elements of the dual 
space X* of X by x* and to write 


(1) <x, x*) 


in place of x*(x). This notation is well adapted to the symmetry (or duality) 
that exists between the action of X* on X on the one hand and the action 
of X on X* on the other. The following theorem states some basic proper- 
ties of this duality. 
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4.3. Theorem Suppose B is the closed unit ball of a normed space X. 
Define 


\|x*|| = sup {| <x, x*>|: x € B} 


for every x* € X*. 


(a) 
(b) 


(c) 


This norm makes X* into a Banach space. 


Let B* be the closed unit ball of X*. For every x € X, 


xl = sup {| <x, x*) |: x* © BY}. 


Consequently, x* > <x, x*> is a bounded linear functional on X*, of 
norm ||x||. 


B* is weak*-compact. 


PROOF. Since #(X, Y) = X*, when Y is the scalar field, (a) is a corol- 
lary of Theorem 4.1. 

Fix x € X. The corollary to Theorem 3.3 shows that there exists 
y* e B* such that 


(1) <x, y*> = |x|). 
On the other hand, 
(2) | <x, x*>| < ||x|l ]x*]] < [xl 


for every x* € B*. Part (b) follows from (1) and (2). 

Since the open unit ball U of X is dense in B, the definition of 
\|x*|| shows that x* e B* if and only if | <x, x*>| < 1 for every x € U. 
Part (c) now follows directly from Theorem 3.15. //// 


Remark. The weak*-topology of X* is, by definition, the weakest one 
that makes all functionals 


x* + <x, x*» 


continuous. Part (b) shows therefore that the norm topology of X* is 
stronger than its weak*-topology; in fact, it is strictly stronger, unless 
dim X < , since the proposition stated at the end of Section 3.11 
holds for the weak*-topology as well. 

Unless the contrary is explicitly stated, X* will from now on 
denote the normed dual of X (whenever X is normed), and all topo- 
logical concepts relating to X* will refer to its norm topology. This 
implies in no way that the weak*-topology will not play an important 
role. 


We now give an alternative description of the operator norm defined 


in Theorem 4.1. 
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4.4 Theorem If X and Y are normed spaces and if A € BX, Y), then 
Al] = sup {| <Ax, y*>|: |[xl] < 1, lly*l] < 1p. 


PROOF. Apply (b) of Theorem 4.3 with Y in place of X. This gives 
| Ax|| = sup {|<Ax, y*>|: ly*ll < 1} 


for every x € X. To complete the proof, recall that 


Al] = sup {|| Ax]: [xl] < 1}. //// 


4.5 The second dual of a Banach space The normed dual X* of a 
Banach space X is itself a Banach space and hence has a normed dual of its 
own, denoted by X**. Statement (b) of Theorem 4.3 shows that every x ¢ X 
defines a unique dx € X**, by the equation 


(1) <x, X¥> = Kx*, OXY (x* € X*), 
and that 
(2) | px|| = |x| (x € X). 


It follows from (1) that @: X — X** is linear; by (2), @ is an isometry. Since 
X is now assumed to be complete, ¢(X) is closed in X**. 

Thus ¢ is an isometric isomorphism of X onto a closed subspace of X**. 

Frequently, X is identified with ¢(X); then X is regarded as a sub- 
space of X**. 

The members of @(X) are exactly those linear functionals on X* that 
are continuous relative to its weak*-topology. (See Section 3.14.) Since the 
norm topology of X* is stronger, it may happen that ¢(X) is a proper 
subspace of X**. But there are many important spaces X (for example, all 
[P-spaces with 1 < p < o) for which $(X) = X**; these are called reflexive. 
Some of their properties are given in Exercise 1. 

It should be stressed that, in order for X to be reflexive, the existence 
of some isometric isomorphism ¢@ of X onto X** is not enough; it 1s crucial 
that the identity (1) be satisfied by @. 


4.6 Annihilators Suppose X is a Banach space, M is a subspace of X, 
and N is a subspace of X*; neither M nor N is assumed to be closed. Their 
annihilators M+ and “N are defined as follows: 


M* = {x* € X*: <x, x*> = 0 for all x € M}, 
*N = {x € X: <x, x*> = 0 for all x* € N}. 


Thus M+ consists of all bounded linear functionals on X that vanish 
on M, and “N is the subset of X on which every member of N vanishes. It 
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is clear that M+ and +N are vector spaces. Since M~ is the intersection of 
the null spaces of the functionals @x, where x ranges over M (see Section 
4.5), Mt is a weak*-closed subspace of X*. The proof that +N is a norm- 
closed subspace of X is even more direct. The following theorem describes 
the duality between these two types of annihilators. 


4.7 Theorem Under the preceding hypotheses, 


(a) *(M°) is the norm-closure of M in X, and 
(b) (“N)* is the weak*-closure of N in X*. 


As regards (a), recall that the norm-closure of M equals its weak 
closure, by Theorem 3.12. 


PROOF. If x eM, then <x, x*)=0 for every x* e M-, so that 
x € '(M*). Since (M+) is norm-closed, it contains the norm-closure 
M of M. On the other hand, if x ¢ M the Hahn-Banach theorem 
yields an x* € M+ such that <x, x*> #0. Thus x ¢ 1(M“), and (a) is 
proved. 

Similarly, if x* e N, then <x, x*> =0 for every x € tN, so that 
x* €(*N)". This weak*-closed subspace of X* contains the weak*- 
closure N of N. If x* ¢ N, the Hahn-Banach theorem (applied to the 
locally convex space X* with its weak*-topology) implies the exis- 
tence of an x € +N such that <x, x*> 40; thus x* ¢ ((N)~, which 
proves (5). [// 


Observe, as a corollary, that every norm-closed subspace of X is the 
annihilator of its annihilator and that the same is true of every weak*- 
closed subspace of X*. 


4.8 Duals of subspaces and of quotient spaces If M is a closed sub- 
space of a Banach space X, then X/M is also a Banach space, with respect 
to the quotient norm. This was defined in the proof of (d) of Theorem 1.41. 
The duals of M and of X/M can be described with the aid of the annihilator 
M~ of M. Somewhat imprecisely, the result is that 


M*=X*/M+ = and-—s (X/M)* = Mt. 


This is imprecise because the equalities should be replaced by isometric 
isomorphisms. The following theorem describes these explicitly. 
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4.9 Theorem Let M be aclosed subspace of a Banach space X. 


(a) The Hahn-Banach theorem extends each m* € M* to a functional 
x* € X*. Define 
om* = x* + M-. 
Then a is an isometric isomorphism of M* onto X*/M*. 
(b) Let x: X + X/M be the quotient map. Put Y = X/M. For each y* € Y*, 
define 


CS 


Then t is an isometric isomorphism of Y* onto M~. 


PROOF. (a) If x* and x* are extensions of m*, then x* — x* is in M*; 
hence x* + M* = x* + M-. Thus a is well defined. A trivial verifica- 
tion shows that o is linear. Since the restriction of every x* e¢ X* to M 
is a member of M*, the range of a is all of X*/M~. 

Fix m* e M*. If x* e X* extends m*, it is obvious that 
\|m*|| < ||x*||. The greatest lower bound of the numbers ||x*|| so 
obtained is ||x* + M*||, by the definition of the quotient norm. Hence 


|m*|| < llom*|| < ||x* I. 


But Theorem 3.3 furnishes an extension x* of m* with ||x*|| = ||m*||. 
It follows that ||om*|| = ||m*||. This completes (a). 

(b) If x e X and y* e Y*, then zx e€ Y; hence x > y*zx is a con- 
tinuous linear functional on X which vanishes for x € M. Thus 
ty* € M-. The linearity of t is obvious. Fix x* ¢ M+. Let N be the 
null space of x*. Since M CN, there is a linear functional A on Y 
such that Ax = x*. The null space of A is 2(N), a closed subspace of 
Y, by the definition of the quotient topology in Y = X/M. By 
Theorem 1.18, A is continuous, that is, A e Y*. Hence tA = Az = x*. 
The range of t is therefore all of M-. 

It remains to be shown that t is an isometry. 

Let B be the open unit ball in X. Then 7B is the open unit ball 
of Y = 2X. Since ty* = y*z, we have 


lIty*|| = lly*z|| = sup {| <2x, y*>|: x € B} 
= sup {|<y, y*>|: y € mB} = |ly*| 
for every y* € Y*. [/// 
Adjoints 


We shall now associate with each T € @(X, Y) its adjoint, an operator 
T* € B(Y*, X*), and will see how certain properties of T are reflected in 
the behavior of T*. If X and Y are finite-dimensional, every T € A(X, Y) 
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can be represented by a matrix [T]; in that case, [T*] is the transpose of 
[T], provided that the various vector space bases are properly chosen. No 
particular attention will be paid to the finite-dimensional case in what 
follows, but historically linear algebra did provide the background and 
much of the motivation that went into the construction of what is now 
known as operator theory. 

Many of the nontrivial properties of adjoints depend on the complete- 
ness of X and Y (the open mapping theorem will play an important role). 
For this reason, it will be assumed throughout that X and Y are Banach 
spaces, except in Theorem 4.10, which furnishes the definition of T*. 


4.10 Theorem Suppose X and Y are normed spaces. To each 
T € BX, Y) corresponds a unique T* € B(Y*, X*) that satisfies 


(1) (Tx, y*> = <x, T*y*) 
for all x € X and all y* € Y*. Moreover, T* satisfies 
(2) [7] = TIL. 


PROOF. If y* € Y* and T € &(X, Y), define 
(3) T*y* = y* o T. 


Being the composition of two continuous linear mappings, 
T*y* e€ X*. Also, 


(x, T*y*> = (T*y*\(x) = y*(Tx) = (Tx, y*, 


which is (1). The fact that (1) holds for every x € X obviously deter- 
mines T*y* uniquely. 
If yf e Y* and y% e€ Y*, then 


<x, T™(yt + y2)> = (Tx, yf + y3> 
= (Tx, yi> + (Tx, yz) 
= «x, T*yf> + <x, T*yz> 
= «x, T*yf + T*yz> 
for every x € X, so that 
(4) T*(yt + yz) = T*yt + T*y3. 


Similarly, T*(ay*) = aT*y*. Thus T*: Y* > X* is linear. Finally, (b) 
of Theorem 4.3 leads to 


|T|| = sup {|< Tx, y*>|: Ix] <1, ly*l < Y 
= sup {| <x, T*y*>|: Ix <1 lly*l < } 
= sup {|| T*y*||: lly*l] < 1} = | T* I. //// 
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4.11 Notation If T maps X into Y, the null space and the range of T 
will be denoted by .(T) and A(T), respectively: 
N(T) = {xe X: Tx = 0}, 
AT) = {ye Y: Tx = y for some x € X}. 


The next theorem concerns annihilators; see Section 4.6 for the notation. 
4.12 Theorem Suppose X and Y are Banach spaces, and T € @&X, Y). 
Then 


N(T*)=AT): and WN(T)=*A(T*). 


PROOF. In each of the following two columns, each statement is obvi- 
ously equivalent to the one that immediately follows and/or precedes 


it. 

y* € V(T*). x € V(T). 

ry” =, Tx = 0. 

<x, T*y*») = 0 for all x. <Tx, y*> = 0 for all y*. 

«Tx, y*> = 0 for all x. <x, T*y*> = 0 for all y*. 

y* € AT). x € *RT*). //// 
Corollaries 


(a) W(T*) is weak*-closed in Y*. 
(b) &(T) is dense in Y if and only if T* is one-to-one. 
(c) T is one-to-one if and only if A(T *) is weak*-dense in X*. 


Recall that M+ is weak*-closed in Y* for every subspace M of 
Y. In particular, this is true of @(T)*. Thus (a) follows from the 
theorem. 

As to (b), A(T) is dense in Y if and only if A(T)~ = {0}; in that 
case, V(T*) = {0}. 

Likewise, *(T*) = {0} if and only if A(T*) is annihilated by no 
x € X other than x = 0; this says that @(T*) is weak*-dense in X*. 

Note that the Hahn-Banach theorem 3.5 was tacitly used in the 
proofs of (b) and (c). 

There is a useful analogue of (b), namely, that A(T) is all of Y if 
and only if T* is one-to-one and its inverse [mapping &@(T*) onto Y*] 
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is bounded. The equivalence of (a) and (d) in the following theorem 
expresses this in slightly different terms. Theorem 4.15 is closely 
related. The union of the following three theorems is sometimes called 
the closed range theorem. 


4.13 Theorem Let U and V be the open unit balls in the Banach spaces 
X and Y, respectively. If T € B(X, Y) and 6 > 0, then the implications 
(a) > (b) > (c) > (@) 


hold among the following statements: 


(a) ||T*y*|| = Olly*|| for every y* € Y*. 
(b) T(U) > 6V. 

(c) T(U) > 6V. 

(qd) T(X)=Y. 


Moreover, if (d) holds, then (a) holds for some 6 > 0. 


PROOF. Assume (a), and pick y, ¢ T(U). Since T(U) is convex, closed, 
and balanced, Theorem 3.7 shows that there is a y* such that 
l<y, y*>| <1 for every ye T(U), but |<yo, y*>| > 1. If x € U, it 
follows that 


| <x, T*y*>| =|<Tx, y*>| < 1. 
Thus || T*y*|| < 1, and now (a) gives 
0 <0|<Vo. Y*P| < Ollyoll Ily*ll S lyoll IT *Y* ll < Iyoll- 


It follows that y e T(U) if ||y|| < 6. Thus (a) > (0). 
____ Next, assume (b). Take 6 = 1, without loss of generality. Then 
T(U) > V. To every y € Y and every ¢ > 0 corresponds therefore an 
x € X with ||x|| < ||y|| and |ly — Tx] <e. 

Pick y, € V. Pick ¢, > Oso that 


[0 @) 


dn <1 — Will 


n=1 


Assume n > 1 and y, is picked. There exists x, such that ||x,|| < |ly,,| 
and |ly, — Tx,|| < ¢,. Put 


Yn+1 = Yn — TX,,. 


By induction, this process defines two sequences {x,,} and {y,}. Note 
that 


IIXn+1 ll < IVn+1ll as ln 7 Tx, < En 
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Hence 


ce @) 


90 00 
xa < Hall + Ye < Iv + De <1. 
1 


n= n=1 n=1 


It follows that x = )° x, is in U (see Exercise 23) and that 


N N 
Tx = lim )) Tx, = lim Y (yn — ne) = V1 


No n=1 Noo n=1 


since yy, >0as N > oo. Thus y, = Tx € T(U), which proves (c). 

Note that the preceding argument is just a specialized version of 
part of the proof of the open mapping theorem 2.11. 

That (c) implies (d) is obvious. 

Assume (d). By the open mapping theorem, there is a 6 > 0 such 
that T(U) > dV. Hence 


|| T*y* || = sup {|<x, T*y*>|: x e U} 
= sup {|<Tx, y*>|: x e U} 


> sup {|<y, y*>|: ye OV} = dlly*| 
for every y* e€ Y*. This is (a). //// 


4.14 Theorem If X and Y are Banach spaces and if T € B(X, Y), then 
each of the following three conditions implies the other two: 


(a) S&T) is closed in Y. 
(b) &(T*) is weak*-closed in X*. 
(c) A&(T*) is norm-closed in X*. 


Remark. Theorem 3.12 implies that (a) holds if and only if A(T) is 
weakly closed. However, norm-closed subspaces of X* are not always 
weak*-closed (Exercise 7, Chapter 3). 


PROOF. It is obvious that (b) implies (c). We will prove that (a) implies 
(b) and that (c) implies (a). 

Suppose (a) holds. By Theorem 4.12 and (b) of Theorem 4.7, 
N(T)~ is the weak*-closure of &(T*). To prove (b) it is therefore 
enough to show that .W(T)~ < &A(T*). 

Pick x* € W(T)*. Define a linear functional A on &(T) by 


NESS XxX (x € X). 


Note that A is well defined, for if Tx = Tx’, then x — x’ € W(T); 
hence 


«x — x’, x*>) =0. 
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The open mapping theorem applies to 
T: X ~ A(T) 


since @(T) is assumed to be a closed subspace of the complete space Y 
and is therefore complete. It follows that there exists K < oo such that 
to each y € €(T) corresponds an x € X with Tx = y, |x| < Kllyll, 
and 


|Ay| = |ATx| =| <x, x*>| < Kllyll |x". 


Thus A is continuous. By the Hahn-Banach theorem, some y* € Y* 
extends A. Hence 


CPx yy SAT XS KX xX (x € X). 


This implies x* = T*y*. Since x* was an arbitrary element of (T)~, 
we have shown that W(T)* < &(T*). Thus (b) follows from (a). 

Suppose next that (c) holds. Let Z be the closure of A(T) in Y. 
Define S € BX, Z) by setting Sx = Tx. Since A(S) is dense in Z, 
Corollary (b) to Theorem 4.12 implies that 


S*: Z* — X* 


is One-to-one. 
If z* e€ Z*, the Hahn-Banach theorem furnishes an extension y* 
of z*; for every x € X, 


«x, T*y*) = Tx, y*) = (Sx, 2*> = <x, S*2*). 


Hence S*z* = T*y*. It follows that S* and T* have identical ranges. 
Since (c) is assumed to hold, A@(S*) is closed, hence complete. 
Apply the open mapping theorem to 


S*: Z* + AS*). 


Since S* is one-to-one, the conclusion is that there is a constant c > 0 
which satisfies 


c||z*|| < |S*z*| 


for every z* € Z*. Hence S: X > Z is an open mapping, by Theorem 
4.13. In particular, S(X) = Z. But A(T) = AS), by the definition of S. 
Thus &€(T) = Z, a closed subspace of Y. 

This completes the proof that (c) implies (a). /f// 


The following consequence is useful in applications. 


Theorem Suppose X and Y are Banach spaces, and T € &(X, Y). 
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(a) A(T)= Y if and only if 
(b) T* is one-to-one and &(T*) is norm-closed. 


PROOF. If (a) holds then T* is one-to-one, by Theorem 4.12. The 
implication (d) > (a) of Theorem 4.13 shows that T* is (a multiple of) 
a dilation; hence &(T*) is closed, by Theorem 1.26. 

If (b) holds, then @(T) is dense in Y, again by Theorem 4.12, and 
A(T) is closed by Theorem 4.14. 


Compact Operators 


4.16 Definition Suppose X and Y are Banach spaces and U is the open 
unit ball in X. A linear map T: X — Y is said to be compact if the closure of 
T(U) is compact in Y. It is clear that T is then bounded. Thus T € A(X, Y). 


Since Y is a complete metric space, the subsets of Y whose closure is 
compact are precisely the totally bounded ones. Thus T € &(X, Y) 1s 
compact if and only if T(U) is totally bounded. Also, T is compact if and 
only if every bounded sequence {x,} in X contains a subsequence {x,,} such 
that {Tx,,} converges to a point of Y. 

Many of the operators that arise in the study of integral equations are 
compact. This accounts for their importance from the standpoint of appli- 
cations. They are in some respects as similar to linear operators on finite- 
dimensional spaces as one has any right to expect from operators on 
infinite-dimensional spaces. As we shall see, these similarities show up par- 
ticularly strongly in their spectral properties. 


4.17 Definitions (a) Suppose X is a Banach space. Then &@(X) [which is 
an abbreviation for @(X, Y)] is not merely a Banach space (see Theorem 
4.1) but also an algebra: If S € A(X) and T € &(X), one defines ST € A(X) 
by 


(ST)\(x) = S(T(x)) (x € X). 
The inequality 
IST || < [S| TI 


is trivial to verify. 

In particular, powers of T € @(X) can be defined: T° = I, the identity 
mapping on X, given by Ix = x, and T" = TT" ', forn = 1, 2, 3,.... 

(b) An operator T € &(X) is said to be invertible if there exists 
S € BX) such that 
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In this case, we write S = T~'. By the open mapping theorem, this happens 
if and only if W(T) = {0} and A(T) = X. 

(c) The spectrum o(T) of an operator T € &(X) is the set of all scalars 
A such that T — AI is not invertible. Thus 4 € o(T) if and only if at least one 
of the following two statements is true: 


(i) The range of T — AI is not all of X. 
(ii) T — Al is not one-to-one. 


If (ii) holds, A is said to be an eigenvalue of T; the corresponding 
eigenspace is WV(T — AI); each x € W(T — Al) (except x = 0) is an eigen- 
vector of T; it satisfies the equation 


Tx = Ax. 


Here are some very easy facts which will illustrate these concepts. 


418 Theorem Let X and Y be Banach spaces. 


(a) If T € BX, Y) and dim &(T) < o, then T is compact. 

(b) If T € BX, Y), T is compact, and &(T) is closed, then dim &(T) < o. 

(c) The compact operators form a closed subspace of B(X, Y) in its norm- 
topology. 

(d) If T € &X), T is compact, and 1 # 0, then dim V(T — Al) < o. 

(e) Ifdim X = wo, T € &X), and T is compact, then 0 € o(T). 

(f) IfS € BX), T € BX), and T is compact, so are ST and TS. 


PROOF. Statement (a) is obvious. If A(T) is closed, then @(T) is com- 
plete (since Y is complete), so that T is an open mapping of X onto 
A(T); if T is compact, it follows that &(T) is locally compact; thus (b) 
is a consequence of Theorem 1.22. 

Put Y = V(T — AI) in (d). The restriction of T to Y is a 
compact operator whose range is Y. Thus (d) follows from (b), and so 
does (e), for if 0 is not in o(T), then @(T) = X. The proof of (f) is 
trivial. 

If S and T are compact operators from X into Y, so is S + T, 
because the sum of any two compact subsets of Y is compact. It 
follows that the compact operators form a subspace 2 of @(X, Y). 
To complete the proof of (c), we now show that ~ is closed. Let T € 
BX, Y) be the closure of £, choose r > O, and let U be the open unit 
ball in X. There exists S € & with ||S — T|| <r. Since S(U) is totally 
bounded, there are points x,, ..., x, in U such that S(U) is covered 
by the balls of radius r with centers at the points Sx;. Since 
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||Sx — Tx|| <r for every x € U, it follows that T(U) is covered by the 
balls of radius 3r with centers at the points Tx,;. Thus T(U) is totally 
bounded, which proves that T é€ X. //// 


The main objective of the rest of this chapter is to analyze the spec- 
trum of a compact T € &@(X). Theorem 4.25 contains the principal results. 
Adjoints will play an important role in this investigation. 


4.19 Theorem Suppose X and Y are Banach spaces and T € &X, Y). 
Then T is compact if and only if T* is compact. 


PROOF. Suppose T is compact. Let {y*} be a sequence in the unit ball 
of Y*. Define 


Sily) = <y, yt (ye Y). 


Since | f(y) —fA(v) | < lly — y'll, {f,} is equicontinuous. Since T(U) 
has compact closure in Y (as before, U is the unit ball of X), Ascoli’s 
theorem implies that {f,} has a subsequence { f,,} that converges uni- 
formly on T(U). Since 


|T*ys — T*ys|| = sup |< Tx, yt — y*)| 
= sup [f,(Tx) —f,(T9)|, 

the supremum being taken over x € U, the completeness of X* 
implies that {T*y*} converges. Hence T* is compact. 

The second half can be proved by the same method, but it may 
be more instructive to deduce it from the first half. 

Let @: X > X** and Ww: Y > Y** be the isometric embeddings 
given by the formulas 

(x, XD = Kx, @x> and — Ky, y*> = Ky", WY), 
as in Section 4.5. Then 
<y*, WTx> = CTX, y*> = <x, T*y*) = CT *y*, Ox) = <y*, T**Ox) 
for all x e X and y* € Y*, so that 
WT = T**¢. 
If x e U, then x lies in the unit ball U** of X**. Thus 
WT(U) < T**(U**). 


Now assume that T* is compact. The first half of the theorem 
shows that T**: X** + Y** is compact. Hence T**(U**) is totally 
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bounded, and so is its subset WT(U). Since w is an isometry, T(U) is 
also totally bounded. Hence T is compact. //// 


4.20 Definition Suppose M is a closed subspace of a topological vector 
space X. If there exists a closed subspace N of X such that 


X=M+N and MoN= {0}, 


then M is said to be complemented in X. In this case, X is said to be the 
direct sum of M and N, and the notation 


X=MON 


is Sometimes used. 


We shall see examples of uncomplemented subspaces in Chapter 5. At 
present we need only the following simple facts. 


4.21 Lemma Let M be aclosed subspace of a topological vector space X. 


(a) If X is locally convex and dim M < oo, then M is complemented in X. 
(b) If dim (X/M) < o, then M is complemented in X. 


The dimension of X/M is also called the codimension of M in X. 


PROOF. (a) Let {e;, ..., €,} be a basis for M. Every x € M has then a 
unique representation 
xX = a,(x)ey +°°' + 4,(x)e,. 


Each «; is a continuous linear functional on M (Theorem 1.21 and 
Lemma 1.20) which extends to a member of X*, by the Hahn-Banach 
theorem. Let N be the intersection of the null spaces of these exten- 
sions. Then X = M@N. 


(b) Let x: X + X/M be the quotient map, let {e,, ..., e,} be a 
basis for X/M, pick x; € X so that 2x; =e; (1 <i<_n), and let N be 
the vector space spanned by {x,,..., x,}. Then X = M@N. ji} 


4.22 Lemma _ [If M is a subspace of a normed space X, if M is not dense 
in X, and ifr > 1, then there exists x € X such that 


Ix|| <r but Ix — y|| > 1 for allye M. 


PROOF. There exists x, € X whose distance from M is 1, that is, 
inf {||x, — yl]: ye M} =1. 
Choose y, € M such that ||x, — y,|| <r, and put x = x, — yj. /T// 
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4.23 Theorem If X is a Banach space, T € &(X), T is compact, and 
A #0, then T — AI has closed range. 


PROOF. By (d) of Theorem 4.18, dim W(T — AI) < o. By (a) of 
Lemma 4.21, X is the direct sum of “W(T — AI) and a closed subspace 
M. Define an operator S € &(M, X) by 


(1) Sx = Tx — Ax. 


Then S is one-to-one on M. Also, &(S) = &(T — Al). To show that 
A(S) is closed, it suffices to show the existence of an r > 0 such that 


(2) r||x|| < ||Sx|| for allx Ee M, 


by Theorem 1.26. 

If (2) fails for every r>0, there exists {x,} in M such that 
|x, | = 1, Sx, +0, and (after passage to a subsequence) Tx, — Xo for 
some X, € X. (This is where compactness of T is used.) It follows that 
AX, > Xo. Thus x, € M, and 


SX>o = lim (ASx,) = 0. 


Since S is one-to-one, xX) =0. But |x,|| =1 for all n, and x = 
lim Ax,, and so ||x9|| =|A|>0. This contradiction proves (2) for 
some r > 0. [I// 


4.24 Theorem Suppose X is a Banach space, T € &(X), T is compact, 
r > 0, and E is a set of eigenvalues A of T such that |4| > r. Then 


(a) foreachA ec E, A(T — Al ¥ X, and 
(b) Eis a finite set. 


PROOF. We shall first show that if either (a) or (b) is false then there 
exist closed subspaces M,, of X and scalars A, € E such that 


(1) M,<M,¢M,c::-, M, # My +1, 
(2) T(M,) < M, for n> 1, 

and 

(3) (T —4,1D(M,) <M, -, for n > 2. 


The proof will be completed by showing that this contradicts the 
compactness of T. 
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Suppose (a) is false. Then &(T — A, 1) = X for some Ay € E. Put 
S = T — A,I, and define M, to be the null space of S". (See Section 
4.17.) Since A, is an eigenvalue of 7, there exists x, € M,, x, 40. 
Since &@(S) = X, there is a sequence {x,} in X such that Sx,,, = x,, 
n= 1,2, 3,.... Then 


(4) SX by = X41 f 0 but Se ey = SX, = 0. 


Hence M, is a proper closed subspace of M,,,. It follows that (1) to 
(3) hold, with 4, = 4,. [Note that (2) holds because ST = TS.] 
Suppose (b) is false. Then E contains a sequence {/,,} of distinct 
eigenvalues of T. Choose corresponding eigenvectors e,, and let M,, 
be the (finite-dimensional, hence closed) subspace of X spanned by 


{e,,..., &,}. Since the 4, are distinct, {e,,..., e,} is a linearly indepen- 
dent set, so that M,_, is a proper subspace of M,. This gives (1). If 
x € M,, then 


X= 4,e; +:°: Oe. 
which shows that Tx € M, and 
(T — A, Dx = ay(Ay — Aner, +075 +, -1(An-1 — Anlen-1 © Ma-1. 


Thus (2) and (3) hold. 
Once we have closed subspaces M,, satisfying (1) to (3), Lemma 
4.22 gives us vectors y, € M,, for n = 2, 3, 4,..., such that 


(5) Ilys 2 and fly,—xl 21 if xeM,-,. 
If 2 << m <n, define 

(6) Z= TYm —(T — An DYn- 

By (2) and (3), z € M,,_,. Hence (5) shows that 
IT Yn — TYmll = Ann — ZI) = lal Wn — An Zl) SAW | > 3 


The sequence {Ty,} has therefore no convergent subsequences, 
although {y,,} is bounded. This is impossible if T is compact. //// 


4.25 Theorem Suppose X is a Banach space, T € &(X), and T is 
compact. 


(a) Ifa #0, then the four numbers 
a=dim V(T — dl) 
B=dim X/A&(T — Al) 
a* = dim V(T* — Al) 
B* = dim X*/A(T* — AD) 


(b) 
(c) 
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are equal and finite. 
If 2 4 Oand A € o(T) then A is an eigenvalue of T and of T*. 


o(T) is compact, at most countable, and has at most one limit point, 
namely, 0. 


Note: The dimension of a vector space is here understood to be either 


a nonnegative integer or the symbol oo. The letter J is used for the identity 
operators on both X and X*; thus 


(T —AD* = T* — AI* = T* — AI, 


since the adjoint of the identity on X 1s the identity on X™*. 


The spectrum o(T) of T was defined in Section 4.17. Theorem 4.24 


contains a special case of (a): B = 0 implies a = 0. This will be used in the 
proof of the inequality (4) below. 


It should be noted that o(T) is compact even if T is not (Theorem 


10.13). The compactness of T is needed for the other assertions in (c). 


PROOF. Put S = T — JI, to simplify the writing. 

We begin with an elementary observation about quotient spaces. 
Suppose M,, is a closed subspace of a locally convex space Y, and k is 
a positive integer such that k < dim Y/M,. Then there are vectors 
Yi, ---» y, in Y such that the vector space M; generated by M, and 
Yi. +--+, y; contains M;_, as a proper subspace. By Theorem 1.42, each 
M, is closed. By Theorem 3.5, there are continuous linear functionals 
A,, ..-, A, on Y such that A;y;=1 but A;y =0 for all ye M,_,. 
These functionals are linearly independent. The following conclusion 
is therefore reached: If & denotes the space of all continuous linear 
functionals on Y that annihilate My, then 


(1) dim Y/M, < dim X. 


Apply this with Y = X, M, = &(S). By Theorem 4.23, &(S) is 
closed. Also, £ = A@(S)- = V(S*), by Theorem 4.12, so that (1) 
becomes 


(2) B<a*. 


Next, take Y = X* with its weak*-topology; take My = &(S*). 
By Theorem 4.14, @(S*) is weak*-closed. Since X now consists of all 
weak*-continuous linear functionals on X* that annihilate @(S*), & is 
isomorphic to *“&(S*) = W(S) (Theorem 4.12), and (1) becomes 


(3) B* <a. 
Our next objective is to prove that 


(4) a<£p. 


110 


PART I: GENERAL THEORY 


Once we have (4), the inequality 
(5) a* < p* 


is also true, since T* is a compact operator (Theorem 4.19). Since 
a < o by (d) of Theorem 4.18, (a) is an obvious consequence of the 
inequalities (2) to (5). 

Assume that (4) is false. Then « > £. Since a < oo, Lemma 4.21 
shows that X contains closed subspaces E and F such that dim F = B 
and 


(6) X =N(S)\OE=AS) OF. 


Every x € X has a unique representation x =x, +x,, with x,¢é 
NS), x, € E. Define x: X — W(S) by setting mx = x,. It is easy to see 
(by the closed graph theorem, for instance) that z is continuous. 

Since we assume that dim /(S)> dim F, there is a linear 
mapping ¢@ of (S) onto F such that @x, = 0 for some x, # 0. Define 


(7) Ox = Tx + bux (x € X). 


Then Me A(X). Since dim &(d) < «, gx is a compact operator; 
hence so is ® (Theorem 4.18). 
Observe that 


(8) ® — AI =S + on. 

If x € E, then xx = 0, (® — ADx = Sx; hence 

(9) (® — AINE) = AS). 

If x € W(S), then mx = x, 

(10) (D — AI)x = ox, 

and therefore 

(11) (D — AI(N(S)) = G(M(S)) = F. 
It follows from (9) and (11) that 

(12) A® —AIN> A(S)+ F=X. 


But if (10) is used with x = x,, we see that / is an eigenvalue of 
®, and since ® is compact, Theorem 4.24 shows that the range of 
® — AI cannot be all of X. This contradicts (12); hence (4) is true and 
(a) is proved. 

Part (b) follows from (a), for if 2 is not an eigenvalue of T, then 
a(T) = 0, and (a) implies that B(T) = 0, that is, that A(T — AN) = X. 
Thus T — AI is invertible, so that 1 ¢ o(T)). 

It now follows from (b) of Theorem 4.24 that 0 is the only pos- 
sible limit point of o(T), that o(T) is at most countable, and that 
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o(T) u {0} is compact. If dim X < oo, then o(T) is finite; if 
dim X = o, then 0€ o(T), by (e) of Theorem 4.18. Thus o(T) is 
compact. This gives (c) and completes the proof of the theorem. //// 


Exercises 


Throughout this set of exercises, X and Y denote Banach spaces, unless the contrary 
is explicitly stated. 


1. 


Let @ be the embedding of X into X** described in Section 4.5. Let t be the 

weak topology of X, and let o be the weak*-topology of X¥**—the one induced 

by X*. 

(a) Prove that @ is a homeomorphism of (X, t) onto a dense subspace of 
(X**, o). 

(b) If B is the closed unit ball of X, prove that ¢(B) is o-dense in the closed unit 
ball of X**. (Use the Hahn-Banach separation theorem.) 

(c) Use (a), (b), and the Banach-Alaoglu theorem to prove that X is reflexive if 
and only if B is weakly compact. 

(d) Deduce from (c) that every norm-closed subspace of a reflexive space X is 
reflexive. 

(e) If X is reflexive and Y is a closed subspace of X, prove that X/Y is reflexive. 

(f) Prove that X is reflexive if and only if X* is reflexive. 

Suggestion: One half follows from (c); for the other half, apply (d) to the sub- 

space @(X) of X**. 


. Which of the spaces cy, ¢', ¢?, @~ are reflexive? Prove that every finite- 


dimensional normed space is reflexive. Prove that C, the supremum-normed 
space of all complex continuous functions, on the unit interval, is not reflexive. 


. Prove that a subset E of @(X, Y) is equicontinuous if and only if there exists 


M < o such that ||A|| < M for every A € E. 


. Recall that X¥* = @(X, (), if C is the scalar field. Hence A* € A(C, X*) for every 


A e€ X*. Identify the range of A*. 


. Prove that T € @(X, Y) is an isometry of X onto Y if and only if T* is an 


isometry of Y* onto X*. 


. Let o and t be the weak*-topologies of X* and Y*, respectively, and prove that 


S is a continuous linear mapping of (Y*, t) into (X*, oa) if and only if S = T* for 
some T € A(X, Y). 


. Let L’ be the usual space of integrable functions on the closed unit interval J, 


relative to Lebesgue measure. Suppose T € AL’, Y), so that T* € B(Y*, L*). 
Suppose &@(T*) contains every continuous function on J. What can you deduce 
about T? 


. Prove that (ST)* = T*S*. Supply the hypotheses under which this makes sense. 
. Suppose S € BX), T € A(X). 


(a) Show, by an example, that ST = I does not imply TS = I. 
(b) However, assume T is compact, show that 


SU — T) = 1 if and only if 7 — T)S = J, 


and show that either of these equalities implies that J —(I — T)~'* is 
compact. 
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10. Assume T € &(X) is compact, and assume either that dim X = o or that the 
scalar field is ¢. Prove that o(T) is not empty. However, o(T) may be empty if 
dim X < oo and the scalar field is R. 

11. Suppose dim X < oo and show that the equality B* =f of Theorem 4.25 
reduces to the statement that the row rank of a square matrix is equal to its 
column rank. 

12. Suppose T € AX, Y) and A(T) is closed in Y. Prove that 


dim V(T) = dim X*/A(T*), 
dim W(T*) = dim Y/&(T). 


This generalizes the assertions « = B* and a* = B of Theorem 4.25. 
13. (a) Suppose T € A(X, Y), T, € AX, Y) for n = 1, 2, 3, ..., each T,, has finite- 
dimensional range, and lim ||T — T,|| = 0. Prove that T is compact. 
(b) Assume Y is a Hilbert space, and prove the converse of (a): Every compact 
T € @X, Y) can be approximated in the operator norm by operators with 
finite-dimensional ranges. Hint: In a Hilbert space there are linear projec- 
tions of norm 1 onto any closed subspace. (See Theorems 5.16, 12.4.) 
14. Define a shift operator S and a multiplication operator M on ¢? by 


0 ifn =0 
Sx\(n) = 
ay) aa 1) ifn>1, 


(Mx\n)=(n+1)7'x(n) ifn >0. 


Put T = MS. Show that T is a compact operator which has no eigenvalue and 
whose spectrum consists of exactly one point. Compute ||T"||, for n = 1, 2, 3,..., 
and compute lim,.,,, ||T"||*”. 

15. Suppose yp is a finite (or o-finite) positive measure on a measure space Q, yp x yu 
is the corresponding product measure on Q x Q, and K € (yu x yp). Define 


(TA Xs) = | Ke f(t) du(t) fe Lw]. 
(a) Prove that T € A(L?(u)) and that 


|T ||? < {| | K(s, t)|? du(s) du(t). 


(b) Suppose a;, b; are members of L(y), for 1 <i <n, put K,(s, t) = >> afs)b{{0), 
and define T, in terms of K, as T was defined in terms of K. Prove that 
dim &(T,) <n. 

(c) Deduce that T is a compact operator on I?(y). Hint: Use Exercise 13. 

(d) Suppose A € ¢, A # 0. Prove: Either the equation 


Tf—Af=g 


has a unique solution fe [7(u) for every g € L(y) or there are infinitely 
many solutions for some g and none for others. (This is known as the Fred- 
holm alternative.) 

(e) Describe the adjoint of T. 


16. 


17. 


18. 


19. 


20. 


21. 
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Define 


(1 —s)t fOc<ct<s 
(1 — t)s fs<t<1 


K(s, t) = } 


and define T ¢ A(L7(0, 1)) by 


1 
(TfXs) = | K(s, t) f(t) dt (O<s< 1). 
0 

(a) Show that the eigenvalues of T are (nn)~*, n= 1, 2, 3, ..., that the corre- 
sponding eigenfunctions are sin nmx, and that each eigenspace is one- 
dimensional. Hint: If 4 #0, the equation Tf = Af implies that f is infinitely 
differentiable, that Af” + f= 0, and that f(0) = f(1) = 0. The case 4 = 0 can 
be treated separately. 

(b) Show that the above eigenfunctions form an orthogonal basis for [7(0, 1). 

(c) Suppose g(t) = > c, sin nat. Discuss the equation Tf — Af = g. 

(d) Show that T is also a compact operator on C, the space of all continuous 
functions on [0, 1]. Hint: If {f;} is uniformly bounded, then {Tf;} is equi- 
continuous. 

If 2 = LO, «) relative to Lebesgue measure, and if 


Ths) =* | 79 a (0<s< oo), 
0 


prove that T € AI’) and that T is not compact. (The fact that ||T|| <2 is a 

special case of Hardy’s inequality. See p. 72 of [23].) 

Prove the following statements: 

(a) If {x,} is a weakly convergent sequence in X, then {||x,||} is bounded. 

(b) If T € AX, Y) and x, > x weakly, then Tx, — Tx weakly. 

(c) If T € A(X, Y), if x, > x weakly, and if T is compact, then ||Tx, — Tx|| > 0. 

(d) Conversely, if X is reflexive, if T €¢ @(X, Y), and if || Tx, — Tx|| +0 when- 
ever x, — x weakly, then T is compact. Hint: Use (c) of Exercise 1, and part 
(c) of Exercise 28 in Chapter 3. 

(e) If X is reflexive and T € &(X, ¢'), then T is compact. Hence A(T) # 7@’. 
Hint: Use (c) of Exercise 5 of Chapter 3. 

(f) If Y is reflexive and T € (cy, Y), then T is compact. 

Suppose Y is a closed subspace of X, and x3 € X*. Put 


w= sup {| <x, x$>|: x € Y, [|x| < 1}, 
6 = inf {||x* — x&||: x* e Y*}. 


In other words, y is the norm of the restriction of x% to Y, and 6 is the distance 
from x* to the annihilator of Y. Prove that ~= 0. Prove also that 6 = 
\|x* — x*|| for at least one x* € Y~. 

Extend Sections 4.6 to 4.9 to locally convex spaces. (The word “isometric” must 
of course be deleted from the statement of Theorem 4.9.) 

Let B and B* be the closed unit balls in X and X*, respectively. The following is 
a converse of the Banach-Alaoglu theorem: If E is a convex set in X* such that 
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E « (rB*) is weak*-compact for every r > 0, then E is weak*-closed. (Corollary: 
A subspace of X* is weak*-closed if and only if its intersection with B* is 
weak*-compact.) 
Complete the following outline of the proof. 
(i) E is norm-closed. 
(ii) Associated to each F c X its polar 


P(F) = {x*: | <x, x*>| < 1 for all x € F}. 


The intersection of all sets P(F), as F ranges over the collection of all finite 
subsets of r~'B, is exactly rB*. 

(iii) The theorem is a consequence of the following proposition: If, in addition 
to the stated hypotheses, E ~\ B* = @, then there exists x € X such that 
Re <x, x*» > 1 for every x* € E. 

(iv) Proof of the proposition: Put F, = {0}. Assume finite sets Fy, ..., Fy—4 
have been chosen so that iF; < B and so that 


(1) P(Fo.) 0°: 0 P(UF,_1;) OE <0 kB* = @. 
Note that (1) is true for k = 1. Put 
Q = P(Fy) 0°: O P(Fy_-1) O E20 (k + 1)B*. 


If P(F) 0 Q # @ for every finite set F < k~‘B, the weak*-compactness of 
Q, together with (ii), implies that (kKB*) 7 Q # @, which contradicts (1). 
Hence there is a finite set F, < k~'B such that (1) holds with k + 1 in place 
of k. The construction can thus proceed. It yields 


(2) Eo ()P(F,) = ©. 
k=1 


Arrange the members of |) F, in a sequence {x,}. Then ||x,|| > 0. Define 
T: X* > Cy by 


UD ea kG cee a 
Then T(E) is a convex subset of c,. By (2), 
|| 7x*]]| = sup |<x,, x*>| > 1 


for every x* € E. Hence there is a scalar sequence {a,}, with ) |«,| < 0, 
such that 


Re > a,<x,,x*> <1 
n=1 
for every x* € E. To complete the proof, put x = >) a, x,. 
Suppose T € &(X), T is compact, A # 0, and S = T — AI. 
(a) If W(S") = V(S"*') for some nonnegative integer n, prove that (S") = 
N(S"*") for k = 1, 2, 3,.... 
(b) Prove that (a) must happen for some n. (Hint: Consider the proof of 
Theorem 4.24.) 


23. 


24. 


25. 
26. 


27. 
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(c) Let n be the smallest nonnegative integer for which (a) holds. Prove that 
dim .V(S") is finite, that 


X = N(S") ® A(S"), 


and that the restriction of S to A(S") is a one-to-one mapping of #(S”) onto 
R(S"). 
Suppose {x,} is a sequence in a Banach space X, and 


3 |x, || = M < oo. 
n=1 


Prove that the series ) x, converges to some x € X. Explicitly, prove that 


lim ||x — (x, +--:+x,)|| =0. 
Prove also that ||x|| < M. (These facts were used in the proof of Theorem 4.13.) 
Let c be the space of all complex sequences 
x= {x,, Xo; X3; ay 
for which x, = lim x, exists (in ). Put ||x|]| = sup |x,|. Let co be the subspace 
of c that consists of all x with x, = 0. 
(a) Describe explicitly two isometric isomorphisms u and v, such that u maps c* 
onto ¢* and v maps c¥ onto 7’. 


(b) Define S: cy > c by Sf = f. Describe the operator vS*u~! that maps /! to 7’. 
(c) Define T: c > Cg by setting 


Vi = Xo» Vat t= Me Me ifn> 1. 


Prove that T is one-to-one and that Tc =c,. Find ||T|| and ||T™'|. 

Describe the operator uT*v ‘ that maps 7! to 7'. 
If T €¢ B(X, Y) and &A(T*) = V(T), prove that A(T) is closed. 
Assume T € &(X, Y) and T(X) = Y. Show that there exists 6 > 0 such that 
S(X) = Y for all S ¢ BX, Y) with |S — TI] <6. 
Suppose T € @(X). Prove that A € o(T) if and only if there is a sequence {x,} in 
X, ||x,|| = 1, for which 

lim || Tx, — Ax, || = 0. 


[Thus every 4 € o(T) which is not an eigenvalue of T is an “approximate” 
eigenvalue. | 


CHAPTER 


SOME 
APPLICATIONS 


This chapter contains some applications of the preceding abstract material 
to more concrete problems in analysis. Most of these applications depend 
only on a small part of the contents of Chapters 1 through 4. Here is a 
partial list of the theorems, ordered more or less according to prerequisites. 


Theorems Prerequisites 

5.23 Vector topologies 

5.27 Minkowski functionals (and Brouwer’s fixed point theorem) 
5.1, 5.2 Closed graph theorem 

5.4 Hahn-Banach theorem 

5.5, 5.7, 5.10, 5.11 Banach-Alaoglu and Krein-Milman theorems 

5.18 Banach-Steinhaus theorem and vector-valued integrals 

5.9, 5.21 Closed range theorem 


A Continuity Theorem 


One of the very early theorems in functional analysis (Hellinger and 
Toeplitz, 1910) states that if T is a linear operator on a Hilbert space H 
which is symmetric in the sense that 


(Tx, y) = (x, Ty) 
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for all x € H and y € H, then T is continuous. Here (x, y) denotes the usual 
Hilbert space inner product. (See Section 12.1.) 

If {x,} is a sequence in H such that ||x,|| +0, the symmetry of T 
implies that Tx, + 0 weakly. (This depends on knowing that all continuous 
linear functionals on H are given by inner products.) The Hellinger-Toeplitz 
theorem is therefore a consequence of the following one. 


5.1 Theorem Suppose X and Y are F-spaces, Y* separates points on Y, 
T: X > Y is linear, and ATx, > 0 for every A € Y* whenever x, > 0. Then 
T is continuous. 


PROOF. Suppose x, > x and Tx, — y. If Ae Y*, then 
AT(x, — x) 70 
so that 
Ay = lim ATx, = ATx. 
Consequently, y = Tx, and the closed graph theorem can be applied. 


//// 


In the context of Banach spaces, Theorem 5.1 can be stated as 
follows: If T: X > Y is linear, if ||x,|| + 0 implies that Tx, 0 weakly, then 
||x,,|| 0 actually implies that ||Tx,]|| > 0. 

To see that completeness is important here, let X be the vector space 
of all complex infinitely differentiable functions on (— 00, 00) which vanish 
outside the unit interval, put 


a= | Fa IMl=GAN™, 


and define T: X >~ X by (Tf)(x) = if'(x). Then (Tf, g) =({, Tg), but T 1s 
not continuous. 
Closed Subspaces of L?-Spaces 


The proof of the following theorem of Grothendieck also involves the 
closed graph theorem. 


5.2 Theorem Suppose 0 < p < 00, and 


(a) pisa probability measure on a measure space Q. 
(b) Sis aclosed subspace of [?(). 
(c) Sco L°(w). 


Then S is finite-dimensional. 
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PROOF. Let j be the identity map that takes S into L®, where S is given 
the I[?-topology, so that S is complete. If {f,} is a sequence in S such 
that f, > fin S and f, > g in L®, it is obvious that f= g a.e. Hence j 
satisfies the hypotheses of the closed graph theorem, and we conclude 
that there is a constant K < oo such that 


(1) Ifo SKIS I, 


for all fe S. As usual, || f ||, means ({ | f |? du)'”, and ||f||,, is the 
essential supremum of | f |. If p < 2 then ||f||, < ||fll2. If 2 < p< o, 
integration of the inequality 


FP Ste: We 
leads to || f ||, < K”’*|| f||,. In either case, we have a constant M < co 
such that 
(2) Ifo SMIf ll. UWeS). 


In the rest of the proof we shall deal with individual functions, 
not with equivalence classes modulo null sets. 

Let {¢,,..., @,} be an orthonormal set in S, regarded as a sub- 
space of I?. Let Q be a countable dense subset of the euclidean unit 
ball B of ". Ifc = (cy, ..., c,) € B, define f, = >) c;;. Then || f. ||. < 1, 
and so || f. ||, <M. Since Q is countable, there is a set Q’ <Q, with 
p(Q’) = 1, such that | f.(x)| < M for every c € Q and for every x €. 
If x is fixed, c—| f,.(x)| is a continuous function on B. Hence 
| f.(x)| < M whenever c € B and x € 9’. It follows that ¥ | @x)|* < 
M? for every x € Q’. Integration of this inequality gives n < M*. We 
conclude that dim S < M*. This proves the theorem. //// 


It is crucial in this theorem that L® occurs in the hypothesis (c). To 
illustrate this we will now construct an infinite-dimensional closed subspace 
of L' which lies in I*. For our probability measure we take Lebesgue 
measure on the circle, divided by 27. 


5.3 Theorem Let E be an infinite set of integers such that no integer has 
more than one representation as a sum of two members of E. Let P, be the 
vector space of all finite sums f of the form 


[eo @) 


(1) fe"= ¥ ene” 


n= —o 


in which c(n) = 0 whenever n is not in E. Let S,; be the L'-closure of P,;. Then 
S, is a closed subspace of I’. 
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An example of such a set is furnished by 2*, k = 1, 2, 3,.... Much 
slower growth can also be achieved. 


PROOF. If fis as in (1), then 


f 7(e') mst oy c(n)?e?!"? ate y c(n)c(m)ei" + ™?, 


n n#=m 


Our combinatorial hypothesis about E implies that 


furit= [ier =) le(n)|* +4 d) |e(m)|*| (n)|? 


m<n 


so that 


(2) [iris 2 lamer =2{ firey. 


HOlder’s inequality, with 3 and 3 as conjugate exponents, gives 


(3) {ire < (firs) (fis) 


It follows from (2) and (3) that 
(4) Ifllas24fll, and [fll s 2°? fll 


for every fe P,;. Every L'-Cauchy sequence in P, is therefore also a 
Cauchy sequence in I*. Hence S,< I*. The obvious inequality 
I fll, < | fllq then shows that S, is closed in I’. HI 


An interesting result can be obtained by applying a duality argument 
to the second inequality (4). Recall that the Fourier coefficients g(n) of every 
g € L® satisfy } | g(n)|? < 00. The next theorem shows that nothing more 
can be said about the restriction of g to E. 


5.4 Theorem I[fE is as in Theorem 5.3 and if 
y | a(n) |? = A* < © 
then there exists g € L® such that g(n) = a(n) for everyne E. 


PROOF. If fe P,, the preceding proof shows that 


IDF (n)a(n)| < ACY £()7}2? = All fll, <2? All fle. 


Hence f— )) f (n)a(n) is a linear functional on P, which is continuous 
relative to the L'-norm. By the Hahn-Banach theorem, this functional 
has a continuous linear extension to L’. Hence there exists g € L” 
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(with ||g||,, < 2'/*A) such that 


y f(n)a(n) = — ~ |" f(e"“)g(e") dd (fe Pz). 


With f(e'*) = e'"® (n € E), this shows that g(n) = a(n). //// 


The Range of a Vector-Valued Measure 


We now give a rather striking application of the theorems of Krein-Milman 
and Banach-Alaoglu. 

Let Mt be a o-algebra. A real-valued measure 1 on Wt is said to be 
nonatomic if every set E € M with |1|(E) > 0 contains a set A € Mt with 
0 <|A|(A) <|A|(E). Here |A| denotes the total variation measure of 4; the 
terminology is as in [23]. 


5.55 Theorem Suppose p,,..., p, are real-valued nonatomic measures on 
a o-algebra IR. Define 


HCE) = (u,(E), ..., HE) = (E € MY). 


Then p is a function with domain IN whose range is a compact convex subset 
of R". 


PROOF. Associate to each bounded measurable real function g the 


vector 
rg=(|o djty,..., [¢ iu.) 


in R". Puto =|z,|+--- +|y,|. If g, = 9, ae. [o], then Ag, = Ag2. 
Hence A may be regarded as a linear mapping of [°(c) into R". 

Each yp; is absolutely continuous with respect to o. The Radon- 
Nikodym theorem [23] shows therefore that there are functions 
h,;¢ L'(c) such that du;=h,do (1 <i<n). Hence A is a weak*- 
continuous linear mapping of L°(c) into R"; recall that L°(c) = L’(a)*. 
Put 


= {ge L(o):0<g< lI}. 
It is obvious that K is convex. Since g € K if and only if 
0< | fede | fdo 
for every nonnegative f € I’(c), K is weak*-closed. And since K lies in 


the closed unit ball of L°(o), the Banach-Alaoglu theorem shows that 
K is weak*-compact. Hence A(K) is a compact convex set in R". 
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We shall prove that p(t) = A(K). 

If y, is the characteristic function of a set E « Mi, then vy, € K 
and p(E) = Ag. Thus p(t) < A(K). To obtain the opposite inclusion, 
pick a point p € A(K) and define 


K,={g € K: Ag = p}. 


We have to show that K, contains some x,, for then p = p(E). 

Note that K, is convex; since A is continuous, K, is weak*- 
compact. By the Krein-Milman theorem, K, has an extreme point. 

Suppose gy € K, and gp is not a characteristic function in L”(o). 
Then there is a set E e Mt and an r > 0 such that o(£) > 0 andr< 
go <1—ron E. Put Y = yx,°: Lo). Since o(E) > 0 and o is non- 
atomic, dim Y > n. Hence there exists g € Y, not the zero element of 
[*(c), such that Ag = 0, and such that —r<g<rpr. It follows that 
Jo +g and gy — g are in K,. Thus gp is not an extreme point of K,. 

Every extreme point of K, is therefore a characteristic function. 
This completes the proof. //// 


A Generalized Stone-Weierstrass Theorem 


The theorems of Krein-Milman, Hahn-Banach, and Banach-Alaoglu will 
now be applied to an approximation problem. 


5.6 Definitions Let C(S) be the familiar sup-normed Banach space of 
all continuous complex functions on the compact Hausdorff space S. A 
subspace A of C(S) is an algebra if fg € A whenever fe A and ge A. A set 
E — S is said to be A-antisymmetric if every f € A which is real on E 1s 
constant on E; in other words, the algebra A, which consists of the 
restrictions f|, of the functions fe A to E contains no nonconstant real 
functions. 

For example, if S 1s a compact set in ¢ and if A consists of all fe C(S) 
that are holomorphic in the interior of S, then every component of the 
interior of S is A-antisymmetric. 

Suppose A c C(S), p € S, q € S, and write p ~ q provided that there is 
an A-antisymmetric set E which contains both p and q. It is easily verified 
that this defines an equivalence relation in S and that each equivalence class 
is a closed set. These equivalence classes are the maximal A-antisymmetric 
sets. 


5.7 Bishop’s theorem Let A be a closed subalgebra of C(S). Suppose 
g € C(S) and g|,; € A, for every maximal A-antisymmetric set E. Then g € A. 


Stated differently, the hypothesis on g is that to every maximal 4- 
antisymmetric set E corresponds a function f € A which coincides with g on 
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E; the conclusion is that one f exists which does this for every FE, namely, 
f=g. 
A special case of Bishop’s theorem is the Stone- Weierstrass theorem: 
Suppose that 


(a) Ais aclosed subalgebra of C(S), 

(b) A is self-adjoint (i.e., f € A for all f € A), 

(c) A separates points on S, and 

(d) at every pe S, f(p) # 0 for some f € A. Then A = C(S). 


For in this case the real-valued members f + f of A separate points on 
S. Therefore no A-antisymmetric set contains more than one point. It 
follows that every g € C(S) satisfies the hypothesis of Bishop’s theorem. 


PROOF. The annihilator A* of A consists of all regular complex Borel 
measures on S such that | f du = 0 for every f € A. Define 


K = {we A’: |u| < 1}, 


where || || = | w|(S). Then K is convex, balanced, and weak*-compact, 
by (c) of Theorem 4.3. If K = {0}, then A+ = {0}; hence A = C(S), and 
there is nothing to prove. 

Assume K # {0}, and let be an extreme point of K. Clearly, 
| ul] = 1. Let E be the support of yw; this means that E is compact, that 
| u|(E) = |||, and that E is the smallest set with these two properties. 

We claim: E is antisymmetric. 

Consider an fe A with f|, real; without loss of generality, 
—1</f< 1on E. Define measures o and t by 


do=4(11+f)dp, dtr=41—f) dp. 


Since A is an algebra, o € A* and t € A’. Since 1 +f and 1 —f are 
positive on E, ||a|| > 0, |/t|| > 0, and 


1 
lol] + |tl =3 (0 +fdlul+5 | dl —f) d\n} = HCE) = 1. 


This shows that w is a convex combination of the measures o, = 
o/\|o|| and t, = t/||t||. Both of these are in K. Since p is extreme in K, 
yt = 6,. In other words, 


21 +f) du = loll du. 


Therefore f = 2||o|| — 1 on E, 1.e., f|, is constant. 

This proves our claim. 

If g satisfies the hypothesis of the theorem, it follows that 
) g du = O for every p that is extreme in K, hence for every p in the 
convex hull of these extreme points. Since p> |g du is a weak*- 
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continuous function on K, the Krein-Milman theorem implies that 
| g du = 0 for every p € K, hence for every p € A’. 

Every continuous linear functional on C(S) that annihilates A 
thus also annihilates g. Hence g € A, by the Hahn-Banach separation 
theorem. //// 


Note: If (d) is dropped from the hypotheses of the Stone-Weierstrass 
theorem, then (c) implies that there is at most one py, € S, where f(p,.) = 0 
for every f € A. If this is the case, then the proof shows that A = {fe C(S): 


f (Po) = O}. 


Here is an example that illustrates Bishop’s theorem: 


5.8 Theorem Suppose 


(a) K isacompact subset of R" x ¢ and 
(b) ift=(t,,..., t,) € R", the set 


K, = {ze ¢: (t, z) € K} 
does not separate {. If g € C(K), define g, on K, by g,(z) = g(t, 2). 


Assume that g € C(K), that each g, is holomorphic in the interior of K, 
and that e > 0. Then there is a polynomial P in the variables t,,..., t,, 2 such 
that 


| P(t, Z) ae g(t, z)| <€ 
for every (t, z) € K. 


PROOF. Let A be the closure in C(K) of the set of all polynomials 
P(t, z). Since the real polynomials on R"” separate points, every 
A-antisymmetric set lies in some K,. By Theorem 5.7 it is therefore 
enough to show that to every t € R” corresponds an f € A such that 
Si = 9:- 

Fix t € R". By Mergelyan’s theorem [23] there are polynomials 
P{z) such that 


fe)= YP) (eK) 


and | P,;| < 27‘ if i> 1. There is a polynomial Q on R" that peaks at t, 
in the sense that Q(t) = 1 but | Q(s)| < 1 ifs At and K, 4 @. Consider 
a fixed i > 1. The functions @,, defined on K by 


PnlS, 2) = |Q™(s)P{Z)| 
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form a monotonically decreasing sequence of continuous functions 
whose limit is <2~' at every point of K. Since K is compact, it follows 
that there is a positive integer m,; such that ¢,,(s, z)< 27‘ at every 
point of K. The series 


oO 


f(s, z) = ), O™(s)P{(z) 


i=1 


converges uniformly on K. Hence fe A, and obviously f, = g,. /T// 


Two Interpolation Theorems 


The proof of the first of these theorems involves the adjoint of an operator. 
The second furnishes another application of the Krein-Milman theorem. 

The first one (due to Bishop) again concerns C(S). Our notation is as 
in Theorem 5.7. 


5.9 Theorem Suppose Y is a closed subspace of C(S), K is a compact 
subset of S, and |u|(K)=0 for every we Y*. If g € C(K) and |g| <1, it 
follows that there exists f € Y such that f|, = g and|f|<1onS. 


Thus every continuous function on K extends to a member of Y. In 
other words, the restriction map f— f|, maps Y onto C(K). 

This theorem generalizes the following special case. 

Let A be the disc algebra, 1.e., the set of all continuous functions on 
the closure of the unit disc U in ¢ which are holomorphic in U. Take 
S = T, the unit circle. Let Y consist of the restrictions to T of the members 
of A. By the maximum modulus theorem, Y is a closed subspace of C(T). If 
K c T is compact and has Lebesgue measure 0, the theorem of F. and M. 
Riesz [23] states precisly that K satisfies the hypothesis of Theorem 5.9. 
Consequently, to every g € C(K) corresponds anf € A such that f = g on K. 


PROOF. Let p: Y — C(K) be the restriction map defined by pf=f\,. 
We have to prove that p maps the open unit ball of Y onto the open 
unit ball of C(K). 

Consider the adjoint p*: M(K)— Y*, where M(K) = C(K)* 1s 
the Banach space of all regular complex Borel measures on K, with 
the total variation norm || y|| =|p|(K). For each uw € M(K), p*yu 1s a 
bounded linear functional on Y; by the Hahn-Banach theorem, p* pu 
extends to a linear functional on C(S), of the same norm. In other 
words, there exists o € M(S), with ||o|| = || p*u||, such that 


| Fao = <b ow =<oh w= | fa 
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for every fe Y. Regard yu as a member of M(S), with support in K. 
Then o—peY-, and our hypothesis about K implies that 
o(E) = w(E) for every Borel set E c K. Hence |p| < ||o||. We con- 
clude that || z|| < || p*u||. By (b) of Theorem 4.13, this inequality proves 
the theorem. //// 


Note: Since || p*|| = ||p|| < 1, we also have ||o|| < || u|| in the preceding 
proof. It follows that o = p. Hence p*p has a unique norm-preserving exten- 
sion to C(S). 

Our second interpolation theorem concerns finite Blaschke products, 
1.e., functions B of the form 


Z— Oy 


N 
Biz) = 
vl cll 1 — oz" 
where |c| = 1 and |a,| <1 for 1<k<N. It is easy to see that the finite 
Blaschke products are precisely those members of the disc algebra whose 
absolute value is 1 at every point of the unit circle. 

The data of the Pick-Nevanlinna interpolation problem are two finite 
sets of complex numbers, {z), ..., z,} and {wo, ..., w,,}, all of absolute value 
less than 1, with z; 4 z; if i 4 j. The problem is to find a holomorphic func- 
tion fin the open unit disc U, such that | f(z)| < 1 for all ze U, and such 
that 


f(z;) = w; (0<i<n). 


The data may very well admit no solution. For example, if {z), z,} = 
{0, 4} and {wo, w,} = {0, 4}, the Schwarz lemma shows this. But if the 
problem has solutions, then among them there must be some very nice 
ones. The next theorem shows this. 


5.10 Theorem Let {z9,..., z,}, {Wo, .--. Wa} be Pick-Nevanlinna data. 
Let E be the set of all holomorphic functions f in U such that | f| <1 and 
f(z;) = w; for 0 <i<n. If E is not empty, then E contains a finite Blaschke 
product. 


PROOF. Without loss of generality, assume zp = wy = 0. We will show 
that there is a holomorphic function F in U which satisfies 


(1) Re F(z) > 0 for z € U, F(O) = 1, 


1+w; 
l—w, 


L 


for 1 <i<n, 


(2) F(z;) = B; = 
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and which has the form 


a, + Z 


(3) F(z) = = A a 


where c, > 0, } c, = 1, and |a,| = 1. Once such an F is found, put 
=(F—1)/(F +1). This is a _ finite Blaschke product that 
satisfies B(z;) = w; forO <i <n. 
Let K be the set of all holomorphic functions F in U that 
satisfy (1). 
Associate to each np € M(T) = C(T)* the function 


1 e? 4. 
(4) F(Z) = [ s ai - due (z € U). 
If P is the set of all Borel probability measures on T, then yp F, is a 
one-to-one correspondence between P and K (Theorems 11.9 and 
11.30 of [23]). Define A: M(T) > ¢" by 


(5) Ap = (F (21), «-+5 Fyu(Zy))- 
Since E is assumed to be nonempty, there exists >) € P such that 
(6) Avo = B = (By, «--, B,,). 


Since P is convex and weak*-compact, and since A is linear and 
weak*-continuous, A(P) is a convex compact set in ¢ = R2". Since 
B € A(P), B is a convex combination of N < 2n + 1 extreme points of 
A(P) (Exercise 19, Chapter 3). If y is an extreme point of A(P), then 
A~*(y) is an extreme set of K, and every extreme point of A ~ ‘(y) (their 
existence follows from the Krein-Milman theorem) is an extreme point 


of P. It follows that there are extreme points y,,..., Uy of P and 
positive numbers c, with )° c, = 1, such that 
(7) A(cy My +°°* + Cy My) = B. 


Being an extreme point of P, each pu, that occurs in (7) has a 
single point a, € T for its support; hence 


(8) Fyfe) = 2 


If F is now defined by (3), it follows from (7) and (8) that F satisfies (1) 
and (2). [// 


Kakutani’s Fixed Point Theorem 


Fixed point theorems play an important role in many parts of analysis and 
topology. The one we shall now prove will be used to establish the existence 
of a Haar measure on every compact group. Rather than state it for linear 
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maps, we shall state it in terms of affine maps. These are essentially linear 
maps followed by a translation (Exercise 17), but in the present context they 
need not be defined globally. The following definition makes this precise: 

If K is a convex set, Y is a vector space, and T: K — Y satisfies 


T(1 —A)x + Ay) =(1 —A)Tx + ATy 
whenever x € K, ye K,0 < 4 < 1, then T is said to be affine. 


5.11 Theorem Suppose that 


(a) K is anonempty compact convex set in a locally convex space X, and 
(b) Gis an equicontinuous group of affine maps taking K into K. 


Then G has a common fixed point in K. 


More explicitly, the conclusion is that there exists p € K such that 
Tp = p for very T € G. 

Part (b) of the hypothesis may need some explanation. To say that G 
is a group means that every T € G is a one-to-one map of K into K whose 
inverse T~' also belongs to G (so T maps K onto K!) and that T,T, € G 
whenever 7; € G for i= 1, 2. Here (T, T,)x = T,(T, x), of course; note that 
the composition of two affine maps is affine. 

To say that G is equicontinuous (compare with Section 2.3) means 
now that to every neighborhood W of 0 in X corresponds a neighborhood 
V of O in X such that Tx — Ty € W whenever x € K, ye K, x —ye V, 
and T e G. 

Hypothesis (b) is satisfied, for instance, when G is a group of linear 
isometries on a normed space X. 


PROOF. Let Q be the collection of all nonempty compact convex sets 
H cK such that T(A) c H for every T é€ G. Partially order Q by set 
inclusion. Note that Q # @, since K € Q. By Hausdorffs maximality 
theorem, 2 contains a maximal totally ordered subcollection Q,. The 
intersection Q of all members of Q, is a minimal member of Q. The 
theorem will be proved by showing that Q contains only one point. 

Assume, to the contrary, that there exist xe Q, ye Q,x#y. 
Then there is a neighborhood W of 0 in X such that x — y € W. Let 
V be associated to W as in the preceding definition of equicontinuity. 
If Tx — Ty were in V, for some T eé G, then 


x — y= T (Tx) — T (Ty) 


would be in W, a contradiction. We conclude: 


Forno T € Gis Tx — Tyin V. 
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Put z = 3(x + y). Then z € Q. Define G(z) = {Tz: T € G}. This 
“G-orbit of z” 1s G-invariant (i.e., every T € G maps it into itself), 
hence so is its closure Ky = G(z), and therefore co(K,) is a nonempty 
G-invariant compact convex subset of Q. The minimality of Q implies 
that co(K,) = Q. 

Let p be an extreme point of Q. (It exists, by the Krein-Milman 
theorem.) Since Q is compact and Q = co(K,), Theorem 3.25 shows 
that p lies in the closure Ky of G(z). 

Define a set 


E = {(Tz, Tx, Ty): TeG} <QxQxQ. 


Since p € G(z) and Q x Q is compact, the lemma that is stated below 
shows that there exists a point (x*, y*) € Q x Q so that (p, x*, y*) lies 
in the closure of E. Since 2Tz = Tx + Ty for every T e G, it follows 
that 2p = x* + y*, and this implies that x* = y* because p is an 
extreme point of Q. 

But Tx — Ty ¢ V, for every T € G; hence x* — y* ¢ V; hence 
x* # y*, and we have our contradiction. //// 


Lemma_ Suppose that A and B are topological spaces, B is compact, n is 
the natural projection of A x B onto A,and E< A x B. 

If p € A lies in the closure of n(E), then (p, q) lies in the closure of E for 
some q € B. 


PROOF. If the conclusion fails, then every q € B has a neighborhood 
W, < B so that (V, x W,) 0 E = © for some neighborhood V, of p in 
A, The compactness of B implies that Bc W,, U--: U W,, for some 
finite set {q,,...,q,}. Then V,, 1 --: © V,, is a neighborhood of p 
which does not intersect 2(E), contrary to the assumption that p lies in 
the closure of x(E). IT LT 


Haar Measure on Compact Groups 


5.12 Definitions A topological group is a group G in which a topology 
is defined that makes the group operations continuous. The most concise 
way to express this requirement is to postulate the continuity of the 
mapping @: G x G > G defined by 


P(x, y) = xy". 

For each ae G, the mappings x > ax and x — xa are homeomor- 
phisms of G onto G; so is x >x~*. The topology of G is therefore com- 
pletely determined by any local base at the identity element e. 

If we require (as we shall from now on) that every point of G is a closed 
set, then the analogues of Theorems 1.10 to 1.12 hold (with exactly the same 
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proofs, except for changes in notation); in particular, the Hausdorff separa- 
tion axiom holds. 

If fis any function with domain G, its left translates L, f and its right 
translates R, f are defined, for every s € G, by 


(L, f(x) =f (sx), (Rs f(x) =f (xs) (x € G). 


A complex function f on G is said to be uniformly continuous if to 
every e > O corresponds a neighborhood V of e in G such that 


| f(t) —f(s)| <e 


whenever se G,te G,ands ‘te V. 

A topological group G whose topology is compact is called a compact 
group; in this case, C(G) is, as usual, the Banach space of all complex con- 
tinuous functions on G, with the supremum norm. 


5.13 Theorem Let G be a compact group, suppose f € C(G), and define 
H,(f) to be the convex hull of the set of all left translates of f. Then 


(a) s—L, fis a continuous map from G into C(G), and 
(b) the closure of H,(f) is compact in C(G). 


PROOF. Fix ¢ > 0. Since f is continuous, there corresponds to each 
aeéG a neighborhood W, of e such that | f(x) — f(a)| <e if xa~‘ e 
W,. The continuity of the group operations gives neighborhoods V, of 
e which satisfy V;'V, < W,. Since G is compact, there is a finite set 
A c G such that 


(1) G=\V,-a. 
aca 

Put 

(2) V=()\,. 
acaA 


Choose x, y € G so that yx”! € V, and choose ae A so that ya~* € 


V.. Then | f(y) — f(a)| < ¢, and since xa“! = (xy "*\ya")eE VV 
W,, we also have | f(x) — f(a)| < «. 

Thus | f(x) — f(y)| < 2e whenever yx~' € V. 

For any s € G, (ys\xs)~' = yx~*. Hence yx~* € V implies that 
| f(xs) — f(ys)| < 2. This is just another way of saying that 


whenever y lies in the neighborhood Vx of x. This proves (a). 


1 
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As a consequence of (a), {L, f: x € G} is compact in the Banach 
space C(G). Hence (b) follows from part (c) of Theorem 3.20. //// 


5.14 Theorem On every compact group G exists a unique regular Borel 
probability measure m which is left-invariant, in the sense that 


(1) | tam -| (L.f)dm  [seG,feC(G)]. 
G G 


This m is also right-invariant : 


(2) | fam= | (8. am [se G, fe C(G)] 
G G 


and it satisfies the relation 
(3) | f(x) dm(x) = | f(x" ")dm(x) = [fe C(G)]. 
G G 


This m is called the Haar measure of G. 


PROOF. The operators L, satisfy L, L, = L,,, because 
(LL, f(x) = (Lt fsx) = f(tsx) = (Lis A). 


Since each L, is an isometry of C(G) onto itself, {L,: s € G} is an equi- 
continuous group of linear operators on C(G). If fe C(G), let K, be 
the closure of H,(f). By Theorem 5.13, K, is compact. It is obvious 
that L(K ,) = K, for every s € G. The fixed point theorem 5.11 now 
implies that K, contains a function @ such that L,@ = @ for every 
s € G. In particular, ¢(s) = d¢(e), so that @ is constant. By the defini- 
tion of K,, this constant can be uniformly approximated by functions 
in H,(f). 

So far we have proved that to each f € C(G) corresponds at least 
one constant c which can be uniformly approximated on G by convex 
combinations of left translates of f’ Likewise, there is a constant c’ 
which bears the same relation to the right translates of f/ We claim 
that c’ =. 

To prove this, pick ¢ > 0. There exist finite sets {a;} and {b,} in 
G, and there exist numbers «; > 0, 8B; > 0, with }).« =1 =) B,, such 
that 


<é (x € G) 


(4) Cc 2 a; f(a; x) 
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and 


(5) 


<6é (x € G). 


Ce 2» B; f(xb,) 


Put x = b, in (4); multiply (4) by B;, and add with respect to j. 
The result 1s 


(6) 


<6: 


Cc — 2 a; B; f(a; b;) 


Put x =a; in (5), multiply (5) by «;, and add with respect to i, to 
obtain 


< &. 


(7) C= 2 a; B; f(a; b;) 


Now (6) and (7) imply that c = c’. 

It follows that to each fe C(G) corresponds a unique number, 
which we shall write Mf, which can be uniformly approximated by 
convex combinations of left translates of f; the same Mf 1s also the 
unique number that can be uniformly approximated by convex com- 
binations of right translates of f’ The following properties of M are 
obvious: 


(8) Mf = 0 if f> 0. 
(9) M1 = 1. 
(10) M(of)=aMf — if wis a scalar. 
(11) M(L, f) = Mf = M(R, f) for every sé G. 
We now prove that 
(12) M(f+ 9g) = Mf+ Mg. 


Pick ¢ > 0. Then 


(13) Mf — )/ 4; f(a:x) <ée (xeG) 


for some finite set {a,}< G and for some numbers «;>0 with 
)_ a; = 1. Define 

(14) h(x) = ), %,9(4;). 

Then he K,, hence K, < K,, and since each of these sets contains a 


unique constant function, we have Mh = Mg. Hence there is a finite 
set {b;} < G, and there are numbers 8, > 0 with }) 6; = 1, such that 


(15) Mg — >, By h(b;x) <6& (x € G); 
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by (14), this gives 


(16) <e  (xeG). 


Mg — >. a; B; g(a; b; x) 
i,j 


Replace x by b;x in (13), multiply (13) by £;, and add with respect to j, 
to obtain 


<6& (x € G). 


(17) Mf — 3 a; B; f(a; b; x) 
Thus 


< 2¢ (x € G). 


(18) Mf + Mg — ) w BAS + gaib;x) 


Since )° a; 8; = 1, (18) implies (12). 
The Riesz representation theorem, combined with (8), (9), (10), 
and (12), yields a regular Borel probability measure m that satisfies 


(19) Mf = | fam (fe C(G)); 


properties (1) and (2) follow now from (11). 

To prove uniqueness, let pp be a regular Borel probability 
measure on G which is left-invariant. Since m is right-invariant, we 
have, for every f € C(G), 


ls du = | dm(y) ls (yx) du(x) 
G G G 


= | d(x) | S (yx) dm(y) = | f dm. 
G G G 


Hence pp = m. 
The proof of (3) is similar. Put g(x) = f(x~ +). Then 


|. dm { g(xy~*) dm(x) = { dm(x) | fox) dm(y). 


The two inner integrals are independent of y and x, respectively. 
Hence | g dm = J f dm. //// 


Uncomplemented Subspaces 


Complemented subspaces of a topological vector space were defined in 
Section 4.20; Lemma 4.21 furnished some examples. It is also very easy to 
see that every closed subspace of a Hilbert space is complemented 
(Theorem 12.4). We will now show that some very familiar closed subspaces 
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of certain other Banach spaces are, in fact, not complemented. These exam- 
ples will be derived from a rather general theorem about compact groups of 
operators that have an invariant subspace; its proof uses vector-valued 
integration with respect to Haar measure. 

We begin by looking at some relations that exist between com- 
plemented subspaces on the one hand and projections on the other. 


5.15 Projections Let X be a vector space. A linear mapping P: X > X 
is called a projection in X if 


Pla Pp. 


1.e., if P(Px) = Px for every x € X. 
Suppose P is a projection in X, with null space /(P) and range &(P). 
The following facts are almost obvious: 


(a) AP)=NV(I— P)={xeX: Px =x}. 

(b) W(P)= AT — P). 

(c) AP) A N(P) = {0} and X = AP) + V(P). 

(d) If A and B are subspaces of X such that A m B= {0} and X = A + B, 
then there is a unique projection P in X with A = &(P) and B = VW (P). 


Since (I—P)P=0, AP) _N(I-—P). If xe NW(U—P), then 
x — Px = 0, and so x = Px € &(P). This gives (a); (b) follows by applying 
(a) to I—P. If xe AP) NV(P), then x= Px=0; if xe X, then 
x = Px + (x — Px), and x — Px € W(P). This proves (c). If A and B satisfy 
(d), every xe X has a unique decomposition x = x’+ x”, with x’ € A, 
x” € B. Define Px = x’. Trivial verifications then prove (d). 


5.16 Theorem 


(a) If P is acontinuous projection in a topological vector space X, then 
X = A(P)@ NV (P). 
(b) Conversely, if X is an F-space and if X = A@ B, then the projection P 


with range A and null space B is continuous. 


Recall that we use the notation X = A @B only when A and B are 
closed subspaces of X such that A mn B= {0} and A+ B=X. 


PROOF. Statement (a) is contained in (c) of Section 5.15, except for the 
assertion that Z&(P) is closed. To see the latter, note that 
AP) = VN (I — P) and that I — P is continuous. 
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Next, suppose P is the projection with range A and null space B, 
as in (b). To prove that P is continuous we verify that P satisfies the 
hypotheses of the closed graph theorem: Suppose x, — x and Px, > y. 
Since Px, € A and A is closed, we have y € A, hence y = Py. Since 
x, — Px, € B and B is closed, we have x — ye B, hence Py = Px. It 
follows that y = Px. Hence P is continuous. //// 


Corollary. A closed subspace of an F-space X is complemented in X if 
and only if it is the range of some continuous projection in X. 


5.17 Groups of linear operators Suppose that a topological vector 
space X and a topological group G are related in the following manner: To 
every s € G corresponds a continuous linear operator T,: X — X such that 


T.=1, T,=T,T, (seG,teG); 


also, the mapping (s, x) > T, x of G x X into X is continuous. 
Under these conditions, G is said to act as a group of continuous 
linear operators on X. 


5.18 Theorem Suppose 


(a) X isa Fréchet space, 
(b) Y is acomplemented subspace of X, 


(c) G is a compact group which acts as a group of continuous linear oper- 
ators on X, and 


(d) T(Y)cY for every se G. 


Then there is a continuous projection Q of X onto Y which commutes 
with every T,. 


PROOF. For simplicity, write sx in place of T,x. By (6) and Theorem 
5.16, there is a continuous projection P of X onto Y. The desired 
projection Q is to satisfy s-'Qs = Q for all se G. The idea of the 
proof is to obtain Q by averaging the operators s~ ‘Ps with respect to 
the Haar measure m of G: define 


(1) Ox = | s~*Psx dm(s) (xe X). 


To show that this integral exists, in accordance with Definition 
3.26, put 


(2) f{s)=s"'Psx  (seG). 
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By Theorem 3.27, it suffices to show that f,: G— X is continuous. Fix 
So € G; let U be a neighborhood of f,(s,) in X. Put y = Ps, x, so that 


(3) Soy =f,(So). 


Since (s, Z) > sz is assumed to be continuous, sy has a neighborhood 
V, and y has a neighborhood W such that 


(4) s\(W)cU  ifse V,. 
Also, Sy has a neighborhood V, such that 
(5) Psx € W ifs Ee V,. 


The continuity of P was used here. If se V,; 7 V,, it follows from (2), 
(4), and (5) that f(s) e U. Thus f, is continuous. 

Since G is compact, each f, has compact range in X. The 
Banach-Steinhaus theorem 2.6 implies therefore that {s~ ‘Ps: s € G} is 
an equicontinuous collection of linear operators on X. To every 
convex neighborhood U, of 0 in X corresponds therefore a neighbor- 
hood U, of 0 such that s~'Ps(U,) c U,. It now follows from (1) and 
the convexity of U, that Q(U,) < U,. (See Theorem 3.27.) Hence Q is 
continuous. The linearity of Q is obvious. 

If x e X, then Psx € Y, hence s-/Psx € Y by (d), for every s € G. 
Since Y is closed, Qx € Y. 

If x e Y, then sx e Y, Psx = sx, and so s_‘Psx = x, for every 
s € G. Hence Ox = x. 

These two statements prove that Q is a projection of X onto Y. 
To complete the proof, we have to show that 


(6) Oso = So Q for every So € G. 


Note that s~*Pssy = So(SS,)” 'P(sS9). It now follows from (1) and 
(2) that 


Osyx = | s~+*Pss, x dm(s) 
G 
= [ Sof(SSo) dm(s) 
= | So f(s) dm(s) 
G 


= So | £09 dm(s) = So Qx. 


The third equality is due to the translation-invariance of m; for 
the fourth (moving s, across the integral sign), see Exercise 24 of 
Chapter 3. //// 
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5.19 Examples In our first example, we take X = L’, Y = H'. Here L’ 
is the space of all integrable functions on the unit circle, and H' consists of 
those fe L’ that satisfy f(n) = 0 for all n <0. Recall that f(n) denotes the 
nth Fourier coefficient of f: 


(1) f(r) => i, f(e~"® dds (n=0, +1, +2, ...). 


Note that we write {(8) in place of f(e’*), for simplicity. 

For G we take the unit circle, i.c., the multiplicative group of all 
complex numbers of absolute value 1, and we associate to each e's € G the 
translation operators t, defined by 


(2) (t, f)(O) = f(s + 9). 


It is a simple matter to verify that G then acts on L’ as described in Section 
5.17 and that 


(3) (z, f)*(n) = e™f(n). 


Hence 1t,(H*) = H' for every real s. (See Exercise 12.) 
If H' were complemented in L’, Theorem 5.18 would imply that there 
is a continuous projection Q of L' onto H' such that 


(4) t.0 = Qt, for all s. 


Let us see what such a Q would have to be. 
Put e,(0) = e'”’. Then t,e, = e'"Se,, and 


(5) Ts Qe, os Or, en = e""*Qe,,, 
since Q is linear. The first equality in 
(6) e(Qe,)*(k) = (t, Qe,)*(k) = e”*(Qe,) *(k) 


follows from (3), the second from (5). Hence (Qe,)*(k) = 0 when k # n. Since 
['-functions are determined by their Fourier coefficients, it follows that 
there are constants c, such that 


(7) Qe, = Cnen (n= 0.2 be a2 oak 


So far we have just used (4). Since Qe, e H’ for all n, c, = 0 when 
n <0. Since Of = f for every fe H', c, = 1 when n > 0. Thus Q (if it exists 
at all) is the “natural” projection of ’ onto H', the one that replaces f(n) 
by 0 when n < 0. In terms of Fourier series, 


(8) o( y a, et) a y a, eine 
=O) 0 


To get our contradiction, consider the functions 


(9) f{0) = y piri ind (0<r< 1). 


— @ 
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These are the well-known Poisson kernels. Explicit summation of the series 
(9) shows that f, > 0. Hence 


A fa 1 |” 
(10) Ith =5 | [140)| 0 = = | J,(0) 40 = 1 
an ae TO Ns 
for all r. But 
2 1 
(11) (Q5N0) = dre 9 = 1 —re®: 


Fatou’s lemma implies that ||Qf,||,;— oo as r-1, since [|1—e'*|~' d6 
= oo. By (10), this contradicts the continuity of Q. 

Hence H’ is not complemented in L. 

The same analysis can be applied to A and C, where C is the space of 
all continuous functions on the unit circle, and A consists of those fe C 
that have f(n) = 0 for all n < 0. If A were complemented in C, the operator 
Q described by (8) would be a continuous projection from C onto A. Appli- 
cation of Q to real-valued f € C shows that there is a constant M < oo that 
satisfies 


(12) a 


for every f € A. To see that no such M can exist, consider conformal map- 
pings of the closed unit disc onto tall thin ellipses. 

Hence A is not complemented in C. 

However, the projection (8) is continuous as an operator in I, if 
1 < p< oo. Hence H? is then a complemented subspace of L?. This is a 
theorem of M. Riesz (Th. 17.26 of [23]). 

We conclude with an analogue of (b) of Theorem 5.16; it will be used 
in the proof of Theorem 11.31. 


5.20 Theorem Suppose X is a Banach space, A and B are closed sub- 
spaces of X, and X = A+B. Then there exists a constant y < oo such that 
every x€X has a representation x =a+b, where ae A, be B, and 
lal] + |[5\| < yllxll. 


This differs from (b) of Theorem 5.16 inasmuch as it is not assumed 
that A m B= {0}. 


PROOF. Let Y be the vector space of all ordered pairs (a, b), with 
ae A, b € B, and componentwise addition and scalar multiplication, 
normed by 


l(a, 5) = llall + [5I. 
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Since A and B are complete, Y is a Banach space. The mapping 
A: Y ~X defined by 


A(a, b) =a +b 


is continuous, since ||a + b|| < ||(a, b)||, and maps Y onto X. By the 
open mapping theorem, there exists y < co such that each x € X is 
A(a, b) for some (a, 5) with ||(a, 5)|| < y||x|). H/I/ 


Sums of Poisson Kernels 


Let U and T be the open unit disc and the unit circle in ¢. Let L = L'(T) 
be as in Theorem 5.19, with norm 


IF lls =={ | f(e!)| do. 
Te —f 


Associate to each z € U the Poisson kernel P, € L'(T): 
jee al ig 


Pe) = ae 


It is easy to check that || P,||, = 1 for every z € U. 
Call a set E < U nontangentially dense on T if to every e” € T and to 
every € > 0 there is a point z € E such that 


|z — e'*| < min (e, 2(1 — |z/)). 


There are such sets which have no limit point in U. To construct one, let 
O<r,<r,<---, limr, =1, and place m, equally spaced points on the 
circle r, T, taking m, > 2/(1 —r,). 

It is a rather surprising fact that every f ¢ L’(T) can be represented as 
the sum of a convergent series of multiples of Poisson kernels. This was 
proved by F. F. Bonsall as an application of the closed range theorem. Here 
is his more precise statement: 


5.21 Theorem If {2,,2,,23,...} <U is nontangentially dense on T, 
then to every fe L(T) and every ¢>O0 correspond scalars c, such that 


dD lenl < fll, + e and 


J) GPa 


1 


This turns out to be a special case of the following abstract result: 


5.22 Theorem Let {x,} be a sequence in a Banach space X, with ||x,|| < 
1 for all n, and suppose that there is a 6 > 0 such that 


sup |<x,, x*>| > d||x*| 


n 
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for every x* € X*. If > 0, every x € X can then be represented in the form 


[o 6) 
5 a ie ee im 
n=1 


with 6 2 |c,| < [xl] te. 


PROOF. Define T: #1 > X by Tc ya, fore = 107, Ch, Cain el. 
1 
Then, for every x* € X*, 
C6, Fx SC Te, x" = y CK... X*> 
1 
so that 


[0 6) 
DAG te oe 
1 


< ||T*x*| 


sup, |<x,, x*>|, which is >6||x*|| by assumption. Theorem 4.13 
asserts therefore that T maps the set of all c with }° |c,| < 1/5 onto a 
set that contains the open unit ball of X. 

This proves Theorem 5.22. Let us apply it with X = L’(T), 
xX, =P,,, where {z,} is nontangentially dense on T. Every 
g € L°(T) = L'(T)* has a harmonic extension 


if ||cl||, <1. The supremum of the left side, over all such c, is 


oo (ee ; ; 
Ge) = = | P,(e\g(e®) d0 = <P, 9> 


Since {z,} is nontangentially dense on T, Fatou’s theorem concerning 
nontangential limits of bounded harmonic functions implies that 


sup |<P,,, g>| = sup | G(z,)| = llgllo- 


Therefore Theorem 5.21 is a consequence of Theorem 5.22, with 6 = 1. 


//// 


Two More Fixed Point Theorems 


It is a well-known consequence of the axiom of choice that there is no 
measure on the real line R which is finite on compact sets, not identically 
zero, translation-invariant, and defined on the o-algebra of all subsets of R. 
The usual proof that nonmeasurable sets exist shows this. However, if 
countable additivity—a property that, by definition, measures have—is 
weakened to finite additivity, i.e., to the requirement that 


WE, V+: U E,) = w(E,) + ++: + H(E,) 
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for all finite unions of pairwise disjoint sets E;, then there do exist such 
“finitely additive” measures yp which have all the other above-mentioned 
properties. Moreover, one can have 0 < p(E) < 1 forevery Ec R. 

Theorem 5.25 will prove this, with any abelian group G in place of R, 
as an application of an “invariant” version of the Hahn-Banach theorem. 
The latter will be derived from the surprisingly elementary fixed point 
theorem 5.23, due to Markov and Kakutani: 


5.23. Theorem If K is a nonempty compact convex set in a topological 
vector space X and ¥ is a commuting family of continuous affine maps taking 
K into K, then there exists a point p € K such that Tp = p for every T € F¥. 


proor. For Te ¥, put T' = T, T"*! = T o T", for n= 1, 2, 3,.... 
The fact that the averages 


! 
(1) ge ed ett I) 


are also affine maps of K into K leads to the conclusion that any two 
of them (with possibly different T’s and different n’s) commute with 
each other. 

Let ¥* be the semigroup generated by the maps (1). Thus ¥* is 
the collection of all compositions of finitely many averages (1). If f, 
ge F*,andh=fog=g-f, then he F¥*. Since f(g(K)) <f(K) and 
g( f(K)) < g(K), we see that 


(2) I(K) 1 g(K) > h(K). 


Induction shows therefore that the collection { f(K): fe #*} has the 
finite intersection property. Since each f(K) is compact, there is a 
point p € K which lies in f(K) for every fe F*. 

Now fix T € ¥ and let V be a neighborhood of 0 in X. For 
every n>1, pe T,{K), since T, € #*. This means that there exist 
points x, € K such that 


1 
(3) P= = (Xq_ + TX_ +077 + T" 'Xy), 
But then 
1 1 
(4) p— Tp =~ (x, — T"x,) € = (K — K), 


and K—KcnvV for all sufficiently large n, because K — K is 
compact and therefore bounded. Thus p — Tp é V, for every neigh- 
borhood V of 0. This forces p — Tp to be 0. //// 
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5.24 An invariant Hahn-Banach theorem Suppose that Y is a sub- 
space of a normed linear space X, f € Y*,T < &X), and that 


(a) T(Y)c Yand ST = TS for all S, T € T, 
(b) foT =f, for every T eT. 


Then there exists F € X* such that F=f on Y, ||F\| = ||f ||, and 
Fo T=F forevery T eT. 


Briefly, the given I’-invariant f has a I’-invariant Hahn-Banach exten- 
sion F, 


PROOF. Assume that || f|| = 1, without loss of generality. Define 
(1) K = {Ae X*: |All < 1, A =fon Y}. 


It is clear that K 1s convex. The Hahn-Banach theorem implies that K 
is not empty. Since K is weak*-closed, the Banach-Alaoglu theorem 
shows that K is a weak*-compact subset of X*. For every T eT, the 
map 


(2) A>AcT 


is an affine map of K into K which is weak*-continuous (as we will 
see in a moment). Theorem 5.23 shows therefore that some F € K 
satisfies F o T = F forevery T eT. 

To finish, we show that (2) is a weak*-continuous map of X* into 
X*, for every T € A(X). Fix A, € X*, let 


(3) V = {Le X*: | Lx; —(A, T)x;|<e,1<i<n} 

be a typical weak*-neighborhood of A,T, determined by x,,..., 
x, € X and ¢ > 0. Then 

(4) W = {A e& X*: |A(Tx) — Ay(Tx)| <8, 1 <i<n} 


is a weak*-neighborhood of A,, and if A € W, it is clear that AT e€ V. 
//// 


5.25 Theorem If G is an abelian group (with + as group operation) and 
M is the collection of all subsets of G (the “‘ power set’’ of G), then there is a 
function pn: M — [0, 1] such that 


(a) WE, UV E,)= WE,) + WE.) if Ey 0 E, = ©, 
(b) w(E + a)= WE) for all E€ &@,ae€G,and 
(c) pw(G)=1. 
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PROOF. This is trivial if G is finite. So assume G is infinite, and let 
¢(G) be the Banach space of all bounded complex functions on G, 
with the supremum norm. 

Let Y be the space of all fe ?°(G) which have a limit, call it Af, 
at oo. This means that, if fe Y and e > 0, then there is a finite set 
EcG so that | Af —f(x)| < ¢ for all x outside E. Note that Ae Y* 
and that |/A|] = 1. 

Let I be the set of all translation operators t,, for a € G, defined 
by 


(1) (Ta f(x) = f(x — a). 


Since G is abelian, any two members of F commute; each 1, is a linear 
isometry of ?°(G); and it is clear that t,(Y) c Y and that At, =A 
on Y. 

The hypotheses of Theorem 5.24 are thus satisfied, with 
X = ¢%G). We conclude that there exists an extension L of A, a linear 
functional of norm 1 on #%(G), which satisfies 


(2) Lf=Af — for every fe Y 
and 
(3) Lt,f=Lf for every fe 1°(G). 


If we now define p(E) = Ly, (where y, is the characteristic 
function of EcG), then (a) holds because ype, + Xe, = Xe,vk, if 
E, A E, = @, and Lis linear, and (b) holds because 


(4) XE +a(X) = Xe(x f a) = 1,4 n(x). 


It remains to be shown that 0 < p(E) < 1 for every E < G. This 
is done by the following lemma, since A (hence also L) preserves con- 
stants: If f(x) = c for all x e G, then fe Y and Af=c. //// 


5.26 Lemma _ Suppose that X is a normed linear space of bounded func- 
tions, with the supremum norm, and that L is a linear functional on X, such 
that 


[L|| = LQ) = 1. 
ThenO0< Lf<1liffe X andO<f< 1. 


PROOF. Put Lf = a + if. For every real t, 
L(f-4+it)=a—-—F+ (6 + 0). 
Since || f — 4|| < 4, it follows that 


(a—4P+(P +? <|f—Ftitl? sete, 
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so that a? —a+ f* + 2ft <0 for every real t. This forces B = 0, 
hence «* < a, hence0 <a < 1. a, 


5.27 Example Commutativity cannot be dropped from the hypotheses 
of the preceding three theorems. To see this, let G be the free group on two 
generators, a and b. Except for the identity element, G is the union of four 
disjoint sets, say I, II, III, IV, consisting of those reduced words that start 
with a, a~', b, b+, respectively. If is a finitely additive measure on the 
power set of G, with O< w<1 and p(aE) = p(E) = (bE) for all ECG, 
then we see that “(I U IIT U IV) = wl) and wT VU ‘Iu TD = pl. The 
first of these shows that p(IIT) = p(IV) = 0, the second that p(I) = (II) = 0. 
Since singletons must have measure 0, n = 0. Thus Theorem 5.25 fails for 
this group. 

We conclude this chapter with the Schauder-Tychonoff fixed point 
theorem. This is an infinite-dimensional version of Brouwer’s theorem con- 
cerning the fixed point property of closed balls in R”. It is nonlinear, and its 
proof is therefore not really an application of any of the preceding material, 
except that it will involve a Minkowski functional. 


5.28 Theorem If K is a nonempty compact convex set in a locally convex 
space X, and f: K — K is continuous, then f(p) = p for some p € K. 


PROOF. Assume f fixes no point of K. Its graph 
(1) G = {(x, f(x))e X x X:xeK} 


is then disjoint from the diagonal A of X x X and is compact. Hence 
there is a convex balanced neighborhood V of 0 in X such that 
G+(V x V) misses A. In particular, 


(2) I(x) ExtV (x € K). 


Let uw be the Minkowski functional of V. Theorem 1.36 shows 
that pw is continuous on X and that p(x) < 1 if and only if xe V. 
Define 


(3) a(x) = max {0, 1 — u(x)} (x € X). 
Choose x,,..., X, € K so that the sets x; + V (1 <i <n) cover K, put 


a(x) = a(x — x,), and define 


= a(x) ; 
(4) BRD Gy ec a) (xe K,1<i<n), 


noting that the denominator in (4) is positive for every x € K. 
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Let H = co{x,,..., x,}. Then g, defined by 
(5) g(x) =) Bdx)x; = (x € K), 
1 


is a continuous map from K into the compact finite-dimensional 
simplex H < K. The same is true of go/f. Brouwer’s fixed point 
theorem asserts therefore that there is an x* € H such that 


(6) gf (x*)) = x*. 


Since B(x) = 0 outside x; + V, we see that 


(7) x — g(x) = d B(x\(x—x) (eK) 


is a convex combination of vectors x — x; € V. Thus x — g(x) € V, for 
every x € K. In particular, this is true for x = f(x*). We conclude that 


(8) F(x*) € g(f(x*)) + V = x* + V, 
contrary to (2). //// 
Exercises 
1. Define measures y,, u, on the unit circle by 
du, = cos 6 dé, du, = sin 0 dé 
and find the range of the measure p = (1, p>). 
2. Construct two functions f and g on [0, 1] with the following property: If 
du, =f(x) dx, du, = g(x) dx, w= (Hy, Ha), 
then the range of yu is the square with vertices at (1, 0), (0, 1), (—1, 0), (0, — 1). 
3. Suppose that the hypotheses of Theorem 5.9 are satisfied, that d € C(S), d > 0, 


4. 


g € C(K), and |g| < ¢|x. Prove that there exists fe Y such that f|, =g and 
| f|< on S. Hint: Apply Theorem 5.9 to the space of all functions f/¢, with 
fey. 

Supply the details of the proof that every extreme point of P has its support at a 
single point. (This refers to the end of proof of Theorem 5.10.) 


. Prove the analogues of Theorems 1.10 to 1.12 that are alluded to in Section 


5.12. (Do not assume that G is commutative.) 


. Suppose G is a topological group and H is the largest connected subset of G 


that contains the identity element e. Prove that H is a normal subgroup of G, 
that is, a subgroup that satisfies x" 'Hx = H for every x € G. Hint: If A and B 
are connected subsets of G, so are AB and A™?. 


. Prove that every open subgroup of a topological group is closed. (The converse 


is obviously false.) 


. Suppose m is the Haar measure of a compact group G, and V is a nonempty 


open set in G. Prove that m(V) > 0. 


9. 


10. 


11. 


12. 


13. 


14. 
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Put e,(0) = e'””. Let 7 refer to the Haar measure of the unit circle. Let A be the 
smallest closed subspace of I? that contains e, for n = 0, 1, 2,..., let B be the 
smallest closed subspace of I? that contains e_, + ne, for n = 1, 2, 3,.... Prove 
the following: 

(a) An B= {0}. 

(b) If X = A + B then X is dense in I’, but X # LV. 

(c) Although X = A @ B, the projection in X with range A and null space B is 
not continuous. (The topology of X is, of course, the one that X inherits 
from L?. Compare with Theorem 5.16.) 

Suppose X is a Banach space, P € @(X), OQ € BX), and P and Q are projec- 

tions. 

(a) Show that the adjoint P* of P is a projection in X*. 

(b) Show that ||P — Q|| => 1if PQ = OP and P ¥ Q. 

Suppose P and Q are projections in a vector space X. 

(a) Prove that P + Q is a projection if and only if PO = QP = 0. In that case, 


N(P + Q) = V(P) 0 N(Q), 
AP + Q) = AP) + AQ), 
RP) 0 RQ) = {0}. 


(b) If PQ = QP, prove that PQ is a projection and that 


N(PQ) = N(P) + V(Q), 
A(PQ) = AP) ~ AQ). 
(c) What do the matrices 


1 O d 1 -1 
an 
0 0 0 0 
show about part (b)? 


Prove that the translation operators t, used in Example 5.19 satisfy the conti- 
nuity property described in Section 5.17. Explicitly, prove that 


It.g—ts fll, 70 


ifr>sandg—-finL. 

Use the following example to show that the compactness of G cannot be 
omitted from the hypotheses of Theorem 5.18. Take X = L’ on the real line R, 
relative to Lebesgue measure; fe Y if and only if |p f= 0; G = R with the usual 
topology; G acts on L’ by translation: (t, f(x) = f(s + x). The joint continuity 
property is satisfied (see Exercise 12), t,Y = Y for every s, and Y is com- 
plemented in X. Yet there is no projection of X onto Y (continuous or not) that 
commutes with every t,. 

Suppose S and T are continuous linear operators in a topological vector space, 
and 


T = TST. 


Prove that T has closed range. (See Theorem 5.16 for the case S = I.) 
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15. 


16. 


17. 


18. 


19. 
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Suppose A is a closed subspace of C(S), where S is a compact Hausdorff space; 
suppose p is an extreme point of the unit ball of A*; and suppose f € C(S) is a 
real function such that 


| gf du =0 
for every g € A. Prove that f is then constant on the support of u. (Compare 
with Theorem 5.7.) Show, by an example, that the conclusion is false if the word 
“real” is omitted from the hypotheses. 
Suppose X is a vector space, E< X, T: co(E)—> X is affine, and T(E) c E. 
Prove that T(co(E)) < co(E). (This was tacitly used in the proof of Theorem 
5.11.) 
If X and Y are vector spaces and T: X > Y is affine, prove that T — T(0Q) is 
linear. 
Suppose K is a compact set in a Fréchet space X and f: X > K is continuous. 
Prove that f fixes some point of K. 

Do the same if Q is a convex open set in X, Q>K, and f:Q-—- K is 
continuous. 
Prove the existence of a continuous function f on I = [0, 1] which satisfies the 
equation 


f(x) = | ats (x + 2(t)) dt 
8) 


for all x eI. Hint: Denoting the right side by (Tf\x), show that the set 
{Tf: f € C(I)} is uniformly bounded and equicontinuous and that its closure is 


therefore compact in C(/). Apply Schauder’s fixed point theorem (via Exercise 
18). 
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Introduction 


6.1 The theory of distributions frees differential calculus from certain diffi- 
culties that arise because nondifferentiable functions exist. This is done 
by extending it to a class of objects (called distributions or generalized 
functions) which is much larger than the class of differentiable functions to 
which calculus applies in its original form. 

Here are some features that any such extension ought to have in order 
to be useful; our setting is some open subset of R”: 


(a) Every continuous function should be a distribution. 


(b) Every distribution should have partial derivatives which are again dis- 
tributions. For differentiable functions, the new motion of derivative 
should coincide with the old one. (Every distribution should therefore 
be infinitely differentiable.) 


(c) The usual formal rules of calculus should hold. 
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(d) There should be a supply of convergence theorems that is adequate for 
handling the usual limit processes. 


To motivate the definitions to come, let us temporarily restrict our 
attention to the case n = 1. The integrals that follow are taken with respect 
to Lebesgue measure, and they extend over the whole line R, unless the 
contrary is indicated. 

A complex function f is said to be locally integrable if f is measurable 
and {x |f| < co for every compact K c R. The idea is to reinterpret f as 
being something that assigns the number | f@ to every suitably chosen 
“test function’ @, rather than as being something that assigns the number 
f(x) to each x € R. (This point of view is particularly appropriate for func- 
tions that arise in physics, since measured quantities are almost always 
averages. In fact, distributions were used by physicists long before their 
mathematical theory was constructed.) Of course, a well-chosen class of test 
functions must be specified. 

We let 2 = Y(R) be the vector space of all @¢ € C~(R) whose support 
is compact. Then | f@ exists for every locally integrable f and for every 
go € Z. Moreover, @ is sufficiently large to assure that f is determined (a.e.) 
by the integrals | fd. (To see this, note that the uniform closure of 9 con- 
tains every continuous function with compact support.) If f happens to be 
continuously differentiable, then 


(1) [ro=-| (p € D). 


If fe C~(R), then 


(2) [1% = (—1)' | 16% (pe 9, k = 1, 2, 3, ...). 


The compactness of the support of @ was used in these integrations by 
parts. 

Observe that the integrals on the right sides of (1) and (2) make sense 
whether f is differentiable or not and that they define linear functionals on Q. 

We can therefore assign a “kth derivative” to every f that is locally 
integrable: f is the linear functional on 9 that sends ¢ to (—1)* | fo™. 
Note that f itself corresponds to the functional ¢ > | f@. 

The distributions will be those linear functionals on Y that are contin- 
uous with respect to a certain topology. (See Definition 6.7.) The preceding 
discussion suggests that we associate to each distribution A its “derivative” 
A’ by the formula 


(3) A(g) = -AG') (PED). 


It turns out that this definition (when extended to n variables) has all 
the desirable properties that were listed earlier. One of the most important 
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features of the resulting theory is that it makes it possible to apply Fourier 
transform techniques to many problems in partial differential equations 
where this cannot be done by more classical methods. 


Test Function Spaces 


6.2 The space 2(Q) Consider a nonempty open set Q < R". For each 
compact K <Q, the Fréchet space 9, was described in Section 1.46. The 
union of the spaces Y,, as K ranges over all compact subsets of Q, is the 
test function space BQ). It is clear that DQ) is a vector space, with respect 
to the usual definitions of addition and scalar multiplication of complex 
functions. Explicitly, @ € B(Q) if and only if @ €e C%(Q) and the support of 
@ is acompact subset of 2. 
Let us introduce the norms 


(1) lly = max {| D*P(x)|: x EQ, |a| < Nj, 


for @ € BQ) and N = 0, 1, 2, ...; see Section 1.46 for the notations D*% 
and |a|. 

The restrictions of these norms to any fixed Dy < DQ) induce the same 
topology on Dx as do the seminorms py of Section 1.46. To see this, note that 
to each K corresponds an integer Ny such that K c Ky for all N >No. 
For these N, ||@||y = py(@) if 6 € Dr. Since 


(2) lOlly S lOllwss and PN) < Py+i(); 


the topologies induced by either sequence of seminorms are unchanged if 
we let N start at N, rather than at 1. These two topologies of D, coincide 
therefore; a local base is formed by the sets 


(3) y= |b 9x idly <a} (N = 1, 2, 3,..0. 


The same norms (1) can be used to define a locally convex metrizable 
topology on BQ); see Theorem 1.37 and (b) of Section 1.38. However, this 
topology has the disadvantage of not being complete. For example, take 
n=1, Q=R, pick @ € A(R) with support in [0,1], @ > 0 in (0, 1), and 
define 


Val) = Bx — 1) +5 Oe — 2) $0 = hex — m) 


Then {y,,} is a Cauchy sequence in the suggested topology of Z(R), but 
lim w,, does not have compact support, hence is not in D(R). 

We shall now define another locally convex topology t on AQ) in 
which Cauchy sequences do converge. The fact that this t is not metrizable 
is only a minor inconvenience, as we shall see. 
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6.3 Definitions Let Q be a nonempty open set in R". 


(a) For every compact K <Q, t, denotes the Fréchet space topology of 
Qx, as described in Sections 1.46 and 6.2. 


(b) B is the collection of all convex balanced sets Wc QQ) such that 
Dr OW € Tx for every compact K <Q. 


(c) t is the collection of all unions of sets of the form @+ W, with 
@ € AQ) and W e B. 


Throughout this chapter, K will aways denote a compact subset of Q. 
The following two theorems establish the basic properties of the 
topology t, which is quite different from the one discussed in Section 6.2. 
For example, if {x,,} is a sequence in Q, without limit point in Q, and if {c,,} 
is a Sequence of positive numbers, then the set 
{p € BQ): | OX m)| < Cm for m = 1, 2, 3, ...} 


belongs to f, ie., is a t-neighborhood of 0 in AQ). It is this fact (see 
Theorem 6.5) which forces t-bounded sets (and hence t-Cauchy sequences) 
to be concentrated on a common compact set K <Q, and therefore 
t-Cauchy sequences converge. 


6.4 Theorem 


(a) tisatopology in DQ), and B is a local base for t. 
(b) tmakes QQ) into a locally convex topological vector space. 


PROOF. Suppose V, € t, V, € 1, 6 € V; O V;,. To prove (a), it is clearly 
enough to show that 


(1) e+WeVi nV, 


for some W e B. 
The definition of t shows that there exist ¢; € AQ) and W,e B 
such that 


(2) pEeg+WoVv (=1,2). 


Choose K so that 2, contains ¢,, ¢,, and @. Since Dy M W, is open 
in Dy, we have 


(3) -— o € (1 — 6)W, 
for some 6; > 0. The convexity of W, implies therefore that 


(4) @—o,+ 0,W,c (1 — 6,)W, + 6; W, = W,, 
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so that 
(5) @+0,W,co,+W,cV, (=1, 2). 


Hence (1) holds with W = (6,W,) 7 (6, W,), and (a) is proved. 
Suppose next that @, and @, are distinct elements of B(Q), and 
put 


(6) W = {6 € BQ): |bllo < loi — Pallo}, 


where ||@||> is as in (1) in Section 6.2. Then W e B and @, is not in 
od, + W. It follows that the singleton {@,} is a closed set, relative to t. 

Addition is t-continuous, since the convexity of every W e« B 
implies that 


(7) (W¥,+oW)+(¥.+5W)=(¥,+4.)+W 


for any Wy, € AQ), Ww, € AM). 

To deal with scalar multiplication, pick a scalar a) and a ¢) € 
Q(Q). Then 
(8) AP — Hy Po = HP — Go) + (% — Aq) ho. 


If W e€ B, there exists 6 > 0 such that 6¢, ¢ $W. Choose c so that 
2c(|% | + 6) = 1. Since W is convex and balanced, it follows that 


(9) 1p — A Poe W 
whenever |« — a | < 6 and d — dy € cW. 
This completes the proof. //// 


Note: From now on, the symbol AQ) will denote the topological 


vector space (HQ), t) that has just been described. All topological concepts 
related to Z(Q) will refer to this topology t. 


6.55 Theorem 


(a) 
(b) 


(c) 


(d) 


A convex balanced subset V of BQ) is open if and only if V é€ B. 
The topology tx of any Dr < HQ) coincides with the subspace topology 
that Dx inherits from BQ). 
If E is a bounded subset of BQ), then E < Dy for some K <Q, and 
there are numbers My < © such that every $@ € E satisfies the inequal- 
ities 

Illy < My (N = 0, 1, Dita), 
Q(Q) has the Heine-Borel property. 
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(e) 


(f) 


(9) 
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If {o;} is a Cauchy sequence in BQ), then {p;} < Dx for some compact 
K <Q, and 


lim OF == Pilly = 0 (N aa 0, 1, Zi vay)h 

i, joo 
If @; > 0 in the topology of BQ), then there is a compact K < Q which 
contains the support of every $;, and D*f; > 0 uniformly, as i— oo, for 
every multi-index «a. 


In BQ), every Cauchy sequence converges. 


Remark. In view of (b), the necessary conditions expressed by (c), (e), 
and (f) are also sufficient. For example, if E < Dy and ||¢||y < My < 
oo for every ¢ € E, then E is a bounded subset of D, (Section 1.46), 
and now (b) implies that E is also bounded in DQ). 


PROOF. Suppose first that V € t. Pick ¢ € Dy Mm V. By Theorem 6.4, 
@+WcV for some W e Bf. Hence 


P+(QreAW) Ra V. 
Since Dy © W is open in Bx, we have proved that 
(1) Dr AVETx ifVetandKcQ. 


Statement (a) is an immediate consequence of (1), since it is 
obvious that B c t. 

One half of (b) is proved by (1). For the other half, suppose 
Eeétx. We have to show that E= 2, V for some V et. The 
definition of t, implies that to every @ € E correspond N and 6 > 0 
such that 


(2) {We Dg: IW — bly <d} CE. 
Put W, = {W € BQ): Wily < 6}. Then W, e€ B, and 
(3) DrA(d + Ws) = 9 +(BrOW,) CE. 


If V is the union of these sets @ + W,, one for each @ € E, then V has 
the desired property. 

For (c), consider a set E < YQ) which lies in no Y,. Then there 
are functions @¢,,¢ E and there are distinct points x,, € Q, without 
limit point in Q, such that ¢,(x,,) # 0 (m = 1, 2, 3, ...). Let W be the 
set of all 6 € AQ) that satisfy 


(4) PX) < m7 "|Op(X,)| (m= I, 2, 3, ...). 


Since each K contains only finitely many x,,, it 1s easy to see that 
Dy, AW etx. Thus W eé f. Since @¢,, € mW, no multiple of W con- 
tains E. This shows that E is not bounded. 
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It follows that every bounded subset E of DQ) lies in some D,. 
By (b), E is then a bounded subset of Z,. Consequently (see Section 
1.46) 


(5) sup {|ldlly: 6 € E} < a (N= O02 1,2 anc): 


This completes the proof of (c). 

Statement (d) follows from (c), since Z, has the Heine-Borel 
property. 

Since Cauchy sequences are bounded (Section 1.29), (c) implies 
that every Cauchy sequence {¢;} in A) lies in some Dx. By (b), {¢;} 
is then also a Cauchy sequence relative to t,. This proves (e). 

Statement (/f) is just a restatement of (e). 

Finally, (g) follows from (b), (e), and the completeness of D,. 
(Recall that DZ, is a Fréchet space.) //// 


6.6 Theorem Suppose A is a linear mapping of QQ) into a locally 
convex space Y. Then each of the following four properties implies the others: 


(a) 
(b) 
(c) 
(d) 


A is continuous. 

A is bounded. 

If 6; ~0 in DQ) then Ad; > O in Y. 

The restrictions of A to every Bx < BQ) are continuous. 


PROOF. The implication (a) — (b) is contained in Theorem 1.32. 

Assume A is bounded and ¢;—-0 in QQ). By Theorem 6.5, 
g; > 0 in some Q&,x, and the restriction of A to this D, is bounded. 
Theorem 1.32, applied to A: Jy — Y, shows that Ad; > 0 in Y. Thus 
(b) implies (c). 

Assume (c) holds, {¢;} < Dg, and ¢;-0 in Dx. By (b) of 
Theorem 6.5, ¢; ~ 0 in BQ). Hence (c) implies that Ad; > 0 in Y, as 
i oo. Since D, is metrizable, (d) follows. 

To prove that (d) implies (a), let U be a convex balanced neigh- 
borhood of 0 in Y, and put V = A~‘(U). Then V is convex and bal- 
anced. By (a) of Theorem 6.5, V is open in AQ) if and only if 9, nV 
is open in ,, for every Dy < DQ). This proves the equivalence of (a) 


and (d). H// 
Corollary. Every differential operator D*® is a continuous mapping of 
AQ) into BQ). 

PROOF. Since ||D*|ly < |lOlly+)., for N = 0, 1, 2, ..., D* is continuous 


on each Dx. //// 
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6.7 Definition A linear functional on QQ) which is continuous (with 
respect to the topology t described in Definition 6.3) is called a distribution 
in QQ. 

The space of all distributions in Q is denoted by J’(Q). 

Note that Theorem 6.6 applies to linear functionals on GQ). It leads 
to the following useful characterization of distributions. 


6.8 Theorem [fA is a linear functional on BQ), the following two condi- 
tions are equivalent: 


(a) AE D(Q). 
(b) To every compact K <Q corresponds a nonnegative integer N and a 
constant C < oo such that the inequality 


|Ag| < Cllolln 
holds for every @ € Dx. 


PROOF. This is precisely the equivalence of (a) and (d) in Theorem 6.6, 
combined with the description of the topology of DZ, by means of the 
seminorms ||@||, given in Section 6.2. //// 


Note: If A is such that one N will do for all K (but not necessarily 
with the same C), then the smallest such N 1s called the order of A. If no N 
will do for all K, then A is said to have infinite order. 


6.9 Remark Each x € determines a linear functional 6, on BQ), by 
the formula 


OLD) = (>). 


Theorem 6.8 shows that 6, 1s a distribution, of order 0. 

If x = 0, the origin of R”, the functional 6 = 6, 1s frequently called the 
Dirac measure on R". 

Since Y,, for K <Q, is the intersection of the null spaces of these 6, , 
as xX ranges over the complement of K, it follows that each Dx 1s a closed 
subspace of GQ). [This follows also from Theorem 1.27 and part (b) of 
Theorem 6.5, since each Z, is complete.] It is obvious that each D, has 
empty interior, relative to AQ). Since there is a countable collection of sets 
K; < Q such that DQ) = |) Dx,, A) is of the first category in itself. Since 
Cauchy sequences converge in BQ) (Theorem 6.5), Baire’s theorem implies 
that D(Q) is not metrizable. 
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Calculus with Distributions 


6.10 Notations As before, Q will denote a nonempty open set in R”. If 


a =(a,,...,a,) and Pp = (f,,..., B,) are multi-indices (see Section 1.46) then 
(1) laJ=a,+°''+4,, 
(2) DP = Dy 22" DS, where D; = = 
Ox; 
(3) B<ameans f; <a; for 1 <i<n, 
(4) atP=(%,+f;,...,%, + B,). 
If x € R” and y € R", then 
(5) XY Xp to + Xan 
(6) [x] = (x » x)? = (xp + + Rh. 


The fact that the absolute value sign has different meanings in (1) and 
in (6) should cause no confusion. 
If x € R” and «@ is a multi-index, the monomial x* is defined by 


(7) Dale Glens, Or 


6.11 Functions and measures as distributions Suppose f is a locally 
integrable complex function in Q. This means that fis Lebesgue measurable 
and {x |f(x)| dx < oo for every compact K <Q; dx denotes Lebesgue 
measure. Define 


(1) Ap) = I P(x) f(x) dx [pe AQ)]. 
Since 
(2) |A,(?)| < (| if) “Plo (Pe Dy), 


Theorem 6.8 shows that A, € Z(Q). 

It is customary to identify the distribution A, with the function f and 
to say that such distributions “are” functions. 

Similarly, if 4 1s a complex Borel measure on Q, or if pu is a positive 
measure on 2 with p(K) < oo for every compact K c Q, the equation 


(3) AP) = [ edu [pe BOQ)] 


defines a distribution A, in Q, which is usually identified with yp. 


158 part II: DISTRIBUTIONS AND FOURIER TRANSFORMS 


6.12 Differentiation of distributions If a is a multi-index and 
A € 9(Q), the formula 


(1) (D*A\(o) =(—HA(D*9) — [¢ € BO)] 


(motivated in Section 6.1) defines a linear functional D*A on AQ). If 


(2) |Ag| < Clldlly 
for all 6 € Dy, then 


(3) | (D°A\(P)| < C||D* lly < Cl Ollw+ia- 
Theorem 6.8 shows therefore that D*A € 2’(Q). 
Note that the formula 
(4) D*D®A = D***®A = DFD*A 
holds for every distribution A and for all multi-indices « and f, simply 
because the operators D* and D* commute on C%(Q): 
(D*D*A\(p) = (— 1)""(DPA\D*9) 

=(- 1)!2! + | A(D®D*¢) 

= (— 1)**FA(D*** 9) 

= (D**A\(4¢). 


6.13 Distribution derivatives of functions The ath distribution deriv- 
ative of a locally integrable function fin Q is, by definition, the distribution 
D*A,. 

If D’f also exists in the classical sense and is locally integrable, then 
D°f is also a distribution in the sense of Section 6.11. The obvious consis- 
tency problem is whether the equation 


always holds under these conditions. 
More explicitly, the question is whether 


(2) (=) I I (x)(D*P\x) dx = [ (oN 400 dx 


for every @ € DQ). 

If f has continuous partial derivatives of all orders up to N, integra- 
tions by part give (2) without difficulty, if |a|< N. 

In general, (1) may be false. The following example illustrates this, in 
the case n = 1. 


6.14 Example Suppose 2 is a segment in R, and f is a left-continuous 
function of bounded variation in 9. If D =d/dx, it is well known that 
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(Df (x) exists a.e. and that Df € L’. We claim that 


(1) DA, =A, 
where p is the measure defined in Q by 
(2) u(La, b)) = f(b) — f(a). 


Thus DA; = Ap, if and only if f is absolutely continuous. 
To prove (1), we have to show that 


(A, XP) = (DA sd) = —A,(D¢) 
for every @ € DQ), that is, that 


(3) { odu= — [ P(x) f (x) dx. 


But (3) is a simple consequence of Fubini’s theorem, since each side of (3) is 
equal to the integral of #’(x) over the set 


(4) {(x, y: x EQ, yEQ x < y} 


with respect to the product measure of dx and dy. The fact that @ has 
compact support in Q is used in this computation. 


6.15 Multiplication by functions Suppose A « 9(Q) and fe C%(Q). 
The right side of the equation 


(1) (FAG) = AFP) = [ep € BQ)] 

makes sense because fp € AQ) when @ € BQ). Thus (1) defines a linear 

functional fA on BQ). We shall see that fA is, in fact, a distribution in Q. 
Observe that the notation must be handled with care: If f€ QQ), then 


Afis a number, whereas fA is a distribution. 
The proof that fA € Z’(Q) depends on the Leibniz formula 


(2) D*( fg) = 2 Cap(D*~ "f )(D°Q), 


valid for all f and g in C%(Q) and all multi-indices a, which is obtained by 
iteration of the familiar formula 


(3) (uv) = u'v + ww’. 
The numbers c,, are positive integers whose exact value is easily computed 
but 1s irrelevant to our present needs. 

To each compact K <Q correspond C and N such that |Ad| < 


C\ld||y for all @ € Dy. By (2), there is a constant C’, depending on f, K, and 
N, such that || fO||y < C’||Olly for ¢ € Dx. Hence 


(4) I(FAMP)| < CC'llblln = (@ € Dy). 
By Theorem 6.8, fA € J’(Q). 
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Now we want to show that the Leibniz formula (2) holds with A in 
place of g, so that 


(5) D*(fA) = Y) Cap(D* "f(D? A). 
Bsa 
The proof is a purely formal calculation. Associate to each u € R” the func- 
tion h,, defined by 
h(x) = exp (u- x). 


Then D*h, = u*h,. If (2) is applied to h, and h, in place of f and g, the 
identity 


(6) (u+v)*= > cygu* Pv? (ue R", v € R’) 


Bsa 


is obtained. In particular, 
u*=[v+(—v+u4)]’ 


= y Cui px 4 d col 1)!8— Vy B— ry? 
ys 


Bsa 
rea y (— 1) pty? ~ (— 1)le. re 
Ysa y<B<a 
Hence 
1) ify=a 
_ 118 = ( : 
) Pa CoB By ‘0 otherwise. 
Apply (2) to D¥(¢D*~ *f), and use (7), to obtain the identity 
(8) » (— 1)!"!c,g D9(PD* °F) = (— 1)'"'D*6. 
Bsa 


The point of all this is that (8) gives (5). For if @¢ € DQ), then 
D*( fA\(p) = (— 1)'*( fA\(D*o) = (— 1)! A(fD"9) 
= ))(-1)?lezg A(D*(pD**f)) 


Bsa 


= ), Cap(D AY PD* ‘f) 


B<a 


= » Capl(D* "f (DEA) \(9). 


6.16 Sequences of distributions Since J’(Q) is the space of all contin- 
uous linear functions on AQ), the general considerations made in Section 
3.14 provide a topology for Z’(Q)—its weak*-topology induced by Z(Q)}— 
which makes 9'(Q) into a locally convex space. If {A;} is a sequence of 
distributions in Q, the statement 


(1) A, > A in 9'(Q) 
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refers to this weak*-topology and means, explicitly, that 
(2) lim A;¢@ = Ad [d@ € DQ)]. 


In particular, if { f;} is a sequence of locally integrable functions in Q, 
the statements “f; > A in ’(Q)” or “ { f;} converges to A in the distribution 
sense” mean that 


(3) lim | P(x) fix) dx = A@ 


imo JQ 


for every @d € HQ). 
The simplicity of the next theorem, concerning termwise differentia- 
tion of a sequence, is rather striking. 


6.17 Theorem Suppose A, € 9’(Q) for i = 1, 2, 3,..., and 
(1) Ag = lim A, 


exists (as a complex number) for every @ € BQ). Then A € B(Q), and 
(2) D°A; > D*A in BQ), 


for every multi-index «a. 


PROOF. Let K be an arbitrary compact subset of Q. Since (1) holds for 
every @ € Dx, and since Y, is a Fréchet space, the Banach-Steinhaus 
theorem 2.8 implies that the restriction of A to D, is continuous. It 
follows from Theorem 6.6 that A is continuous on AQ); in other 
words, A € JQ). Consequently (1) implies that 


(D*A\(b) = (— 1)!A(D*9) 
= (— 1)" lim A({D*¢) = lim (D*A,\(). /[/// 


i 0c io 


6.18 Theorem I[fA,;— A in 9(Q) and g; > g in C%(Q), then g;A;— gA in 
BQ). 


Note: The statement “g;-g in C%(Q)” refers to the Fréchet space 
topology of C°(Q) described in Section 1.46. 


PROOF. Fix ¢ € AQ). Define a bilinear functional B on C®°(Q) x YQ) 
by 


Big, A) = (gA\(¢) = A(G9). 
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Then B is separately continuous, and Theorem 2.17 implies that 
B(g;, Ai) > Bg, A) as 1 > 00. 
Hence 


(9: AM) > (GAN). //// 


Localization 


6.19 Local equality Suppose A; €¢ AQ) (i= 1, 2) and w is an open 
subset of Q. The statement 


(1) A, =A, in 


means, by definition, that A,@ = A, ¢ for every @ € Aa). 

For example, if f is a locally integrable function and yp is a measure, 
then A, = 0 in @ if and only if f(x) = 0 for almost every x € w, and A, = 0 
in w if and only if w(E) = 0 for every Borel set E < . 

This definition makes it possible to discuss distributions locally. On 
the other hand, it is also possible to describe a distribution globally if its 
local behavior is known. This is stated precisely in Theorem 6.21. The proof 
uses partitions of unity, which we now construct. 


6.20 Theorem [fT is a collection of open sets in R" whose union is Q, 
then there exists a sequence {p;} < BQ), with w; = 0, such that 


(a) each w, has its support in some member of I, 


(b) y" W(x) = 1 for every x € Q, 


i=1 
(c) to every compact K <Q correspond an integer m and an open set 
W > K such that 


(1) W(x) +77 + W(X) = 1 
for allx € W. 


Such a collection {y;} is called a locally finite partition of unity in Q, 
subordinate to the open cover I of Q. Note that it follows from (b) and (c) 
that every point of Q has a neighborhood which intersects the supports of 
only finitely many w;. This is the reason for calling {w;} locally finite. 


PROOF. Let S be a countable dense subset of Q. Let {B,, B,, B;,...} 
be a sequence that contains every closed ball B; whose center p; lies in 
S, whose radius r; is rational, and which lies in some member of I. Let 
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V, be the open ball with center p; and radius r,/2. It is easy to see that 
Q=() %,. 

The construction described in Section 1.46 shows that there are 
functions @; € AQ) such that O< ¢@ <1, ¢;=1 in V,, o; = 0 off B;. 
Define y, = ¢,, and, inductively, 


(2) Wier =U-—O) °° U—O)bi41 (2D). 
Obviously, wy; = 0 outside B,. This gives (a). The relation 
(3) Witeoc+y,;=1-C—¢,):::U-—¢) 


is trivial when i = 1. If (3) holds for some i, addition of (2) and (3) 
yields (3) with i+ 1 in place of i. Hence (3) holds for every i. Since 
o; = 11n V,, it follows that 


(4) Wi(x)+--- + y,(x) = 1 ifxeV, U-':: U V,,. 
This gives (b). Moreover, if K is compact, then K c V, U--- U V,, for 
some m, and (c) follows. iii] 


6.21 Theorem Suppose I is an open cover of an open set Q < R", and 
suppose that to each w € TF corresponds a distribution A,, € B'(@) such that 
(1) A, = Ay in a’ Ao” 
whenever w’ 1 w” # O. 

Then there exists a unique A € 9'(Q) such that 
(2) A=A, inw 


for everyoel. 


PROOF. Let {w;} be a locally finite partition of unity, subordinate to I, 
as in Theorem 6.20, and associate to each i a set w; € I’ such that a; 
contains the support of y;. 

If ¢ € BQ), then d = > Wd. Only finitely many terms in this 
sum are different from 0, since @ has compact support. Define 


(3) Ag = 2 Aol ?). 


It is clear that A is a linear functional on DQ). 

To show that A is continuous, suppose ¢,; > 0 in AQ). There is 
a compact K <Q which contains the support of every ¢,. If m is 
chosen as in part (c) of Theorem 6.20, then 


4) Abj= DAWG) GF =1,23,..) 
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Since W,¢; > 0 in D(q;), as j > 00, it follows from (4) that Ag; > 0. By 
Theorem 6.6, A € 9(Q). 
To prove (2), pick @ € D(w). Then 


(5) Wid € Dw; 0 @) Ue eee 
so that (1) implies A,,(W; ) = A,(W; ¢). Hence 
(6) Ad = Aww d) = Aw, Wid) = Aad, 


which proves (2). 
This gives the existence of A. The uniqueness is trivial since (2) 
(with w; in place of w) implies that A must satisfy (3). //// 


Supports of Distributions 


6.22 Definition Suppose A ¢€ 9’(Q). If w is an open subset of Q and if 
Ad = 0 for every ¢ € D(w), we say that A vanishes in w. Let W be the union 
of all open w <Q in which A vanishes. The complement of W (relative to 
Q) is the support of A. 


6.23 Theorem If W is as above, then A vanishes in W. 


PROOF. W is the union of open sets w in which A vanishes. Let I be 
the collection of these w’s, and let {y;} be a locally finite partition of 
unity in W, subordinate to I’, as in Theorem 6.20. If ¢ € D(W), then 
¢ =). Ww; ¢. Only finitely many terms of this sum are different from 0. 
Hence 


Ad =) Aid) = 0 


since each wy; has its support in some w € IT. /T// 


The most significant part of the next theorem is (d). Exercise 20 com- 


plements it. 


6.24 Theorem Suppose A € 9(Q) and S, is the support of A. 


(a) 
(b) 
(c) 


(d) 


If the support of some @ € BQ) does not intersect S,, then Ad = 0. 

If S, is empty, then A = 0. 

If we C%°Q) and Ww =1 in some open set V containing S,, then 
WA =A. 

If S, is a compact subset of Q, then A has finite order ; in fact, there is a 
constant C < o and a nonnegative integer N such that 


|Ad| < Clidlly 
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for every @ € BQ). Furthermore, A extends in a unique way to a contin- 
uous linear functional on C®(Q). 


PROOF. Parts (a) and (b) are obvious. If w is as in (c) and if @ € D(Q), 
then the support of @¢— wd does not intersect S,. Thus Ad = 
A(Wo) = (WAY(Q), by (a). 

If S, is compact, it follows from Theorem 6.20 that there exists 
we QQ) that satisfies (c). Fix such a w; call its support K. By 
Theorem 6.8, there exist c, and N such that |Ad| <c,||¢||y for all 
@ € D,. The Leibniz formula shows that there is a constant c, such 
that ||Wd||y <c,||||y for every 6 € BQ). Hence 


[A@| = |AWA)| Sc lWblly < cical Olly 


for every d € AQ). 
Since Ad = A(wW@) for all @ € DQ), the formula 


(1) Mf=AWf) [fe crQ)] 


defines an extension of A. This extension is continuous, for if f; > 0 in 
C~(Q), then each derivative of f, tends to 0, uniformly on compact 
subsets of Q; the Leibniz formula shows therefore that Wf; 0 in 
QQ); since A € B’(Q), it follows that Af; > 0. 

If fe C°Q) and if Ky is any compact subset of Q, there exists 
@ € AQ) such that @ =f on Kg. It follows that DQ) is dense in 
C~(Q). Each A € 9’(Q) has therefore at most one continuous exten- 
sion to C%(Q). //// 


Note: In (a) it is assumed that @ vanishes in some open set contain- 
ing S,, not merely that @ vanishes on S,. 

In view of (b), the next simplest case is the one in which S, consists of 
a single point. These distributions will now be completely described. 


6.25 Theorem Suppose A € 9'(Q), p € Q, {p} is the support of A, and A 
has order N. Then there are constants c, such that 


(1) A= ) c,D*6,, 


lal<N 
where 6, is the evaluation functional defined by 
(2) OA?) = Pp). 
Conversely, every distribution of the form (1) has {p} for its support 


(unless c, = 0 for all a). 


PROOF. It is clear that the support of D*6, is {p}, for every multi-index 
a. This proves the converse. 
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To prove the nontrivial half of the theorem, assume that p = 0 
(the origin of R"), and consider a @ € AQ) that satisfies 


(3) (D*o)\(0) = 0 for all « with |a| < N. 


Our first objective is to prove that (3) implies Ad = 0. 
If 7 > 0, there is a compact ball K <Q, with center at 0, such 


that 

(4) |D*p| <n in K, if |o| = N. 

We claim that 

(5) | D*6(x)| < qn~|x|N~'*l (x € K, |a| SN). 


When |a|=N, this is (4). Suppose 1 <i< N, assume (5) is 
proved for all « with |a| = i, and suppose | B| = i — 1. The gradient of 
D*¢ is the vector 


(6) grad D’¢ = (D,D*¢, ..., D, D’¢). 
Our induction hypothesis implies that 
(7) |(grad D’p\(x)| <n-ynX'|x|*"! (x € K), 


and since (D’¢)(0) = 0 the mean value theorem now shows that (5) 
holds with f in place of «. Thus (5) is proved. 

Choose an auxiliary function w € A(R"), which is 1 in some 
neighborhood of 0 and whose support is in the unit ball B of R". 
Define 


(8) W(x) = (*) (r > 0, x © R’). 


If r is small enough, the support of w, lies in rB < K. By Leibniz’ 
formula 


) Di, #0) = © cafD* PU © DFO =H 
It now follows from (5) that 


(10) IY Olly SnCllW lly 


as soon as r is small enough; here C depends on n and N. 

Since A has order N, there is a constant C, such that |Aw| < 
C,\lWily for all W €e D,. Since w, = 1 in a neighborhood of the support 
of A, it now follows from (10) and (c) of Theorem 6.24 that 


[AG] = [AW )| S Crile Olly S nCCi IY lly. 


Since 4 was arbitrary, we have proved that A@ =O whenever (3) 
holds. 
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In other words, A vanishes on the intersection of the null spaces 
of the functionals D*6, (|a«| < N), since 


(11) (D*59)$ = (—1)!*'59(D*¢) = (— 1)"(D*)(0). 


The representation (1) follows now from Lemma 3.9. /T// 


Distributions as Derivatives 


It was pointed out in the introduction to this chapter that one of the aims 
of the theory of distributions is to enlarge the concept of function in such a 
way that partial differentiations can be carried out unrestrictedly. The dis- 
tributions do satisfy this requirement. Conversely—as we shall now see— 
every distribution is (at least locally) D*f for some continuous function f and 
some multi-index «. If every continuous function is to have partial deriv- 
atives of all orders, no proper subclass of the distributions can therefore be 
adequate. In this sense, the distribution extension of the function concept is 
as economical as it possibly can be. 


6.26 Theorem Suppose A € 9'(Q), and K is a compact subset of Q. Then 
there is a continuous function f in Q and there is a multi-index « such that 


(1) Ag = (—1)'" | F(x(D*)(x) dx 


for every pb € Dx. 


PROOF. Assume, without loss of generality, that K < Q, where Q is the 


unit cube in R"”, consisting of all x = (x,, ..., x,) with 0 < x; < 1 for 
i= 1,...,n. The mean value theorem shows that 
(2) |w| < max |(D;¥\(x)| (WE Do) 

xeQ 


fori=1,...,n. Put T=D,D,--- D,. For ye Q, let Q(y) denote the 
subset of Q in which x; < y; (1 <i <n). Then 


(3) Wiy)= | (TY)x)dx (Ww € Do). 


Q(y) 


If N is a nonnegative integer and if (2) is applied to successive deriv- 
atives of w, (3) leads to the inequality 


(4) Wily < max |(T™W)(x)| < | |(T™ * SW) | dx, 
Q 


xeQ 


for every W € Do. 
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Since A € Y’(Q), there exist N and C such that 


(5) |A@|<Clldlly (© Dx). 
Hence (4) shows that 


(6) |A¢| < [1cr™6001 dx (pe Dx). 
K 
By (3), T is one-to-one on Bg, hence on Dy. Consequently, 


TX**: Dy —+QDx is one-to-one. A functional A, can therefore be 
defined on the range Y of T’*' by setting 


(7) A,T™*'p=Ad (PE Dy), 
and (6) shows that 
(8) WIS | vCal dx (y € Y). 
K 
The Hahn-Banach theorem therefore extends A, to a bounded linear 


functional on L’(K). In other words, there is a bounded Borel function 
g on K such that 


(9) Ag =A,T**'$ = | aooxr "oxy dx ( € Dy). 


Define g(x) = 0 outside K and put 


ay foy= | | aaddeesdr, ERD 


Then fis continuous, and n integrations by parts show that (9) gives 


(11) Ag = (— 1)" | foor"*9y09 dx  — (p € Dx). 


This is (1), with « =(N + 2,..., N + 2), except for a possible change 
in sign. //// 


When A has compact support, the local result just proved can be 


turned into a global one: 


6.27 Theorem Suppose K is compact, V and Q are open in R", and 
K <V <Q. Suppose also that A € 9'(Q), that K is the support of A, and that 
A has order N. Then there exist finitely many continuous functions f, in Q 
(one for each multi-index B with B; < N + 2 fori = 1, ..., n) with supports in 
V, such that 


(1) 


A=) D*f,. 
B 
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These derivatives are, of course, to be understood in the distribution 
sense: (1) means that 


(2) Ag =) (- 1)" | soo py(x)dx [pe AQ)]. 
B Q 


PROOF. Choose an open set W with compact closure W, such that 
K <W and Wc V. Apply Theorem 6.26 with Win place of K. Put 
a=(N+2,..., N + 2). The proof of Theorem 6.26 shows that there 
is a continuous function fin Q such that 


(3) Ag = (— 1)" [ys (x)(D*p)(x) dx — [b € B(W)]. 


We may multiply f by a continuous function which is 1 on W and 
whose support lies in V, without disturbing (3). 

Fix Ww € Z(Q), with support in W, such that wy = 1 on some open 
set containing K. Then (3) implies, for every ¢ € D(Q), that 


Ag = Awe) = (— 1)" [yr D*(W) 


=(-1!" | $Y egg DY PYDId. 
Q BK<a 
This 1s (2), with 
fp =(-1)* Pleas f: DTP (B <a). //I/ 


Our next theorem describes the global structure of distributions. 


6.28 Theorem Suppose A € 9’(Q). There exist continuous functions g, in 
Q), one for each multi-index «a, such that 


(a) each compact K < Q intersects the supports of only finitely many g,, and 
(b) A=) D%Q,. 


If A has finite order, then the functions g, can be chosen so that only 
finitely many are different from 0. 


PROOF. There are compact cubes Q; and open sets V; (i = 1, 2, 3, ...) 
such that 0, < V; <Q, Q is the union of the Q;, and no compact 
subset of Q intersects infinitely many V,. There exist @; € A(V,) such 
that ¢; = 1 on Q;. Use this sequence {¢;} to construct a partition of 
unity {w;}, as in Theorem 6.20; each y; has its support in V;,. 
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Theorem 6.27 applies to each w;A. It shows that there are 
finitely many continuous functions f; , with supports in V;, such that 


(1) WA =), Df, 4- 
Define 
(2) In = LA a: 


These sums are locally finite, in the sense that each compact 
K c Q intersects the supports of only finitely many /; ,. It follows that 
each g, is continuous in Q and that (a) holds. 

Since ¢ = >. Wi, ¢, for every d € BQ), we have A =) WA, and 
therefore (1) and (2) give (b). 

The final assertion follows from Theorem 6.27. ITH] 


Convolutions 


Starting from convolutions of two functions, we shall now define the con- 
volution of a distribution and a test function and then (under certain 
conditions) the convolution of two distributions. These are important in the 
applications of Fourier transforms to differential equations. A characteristic 
property of convolutions is that they commute with translations and with 
differentiations (Theorems 6.30, 6.33, 6.37). Also, differentiations may be 
regarded as convolutions with derivatives of the Dirac measure (Theorem 
6.37). 

It will be convenient to make a small change in notation and to use 
the letters u, v,... for distributions as well as for functions. 


6.29 Definitions In the rest of this chapter, we shall write Z and 9’ in 
place of ZR") and F(R"). If uis a function in R", and x € R",t,u and wu are 
the functions defined by 


(1) (t,uXy)=uly—x), wy)=u(—y) (ye R’"). 
Note that 
(2) (t, u(y) = u(y — x) = u(x — y). 


If u and v are complex functions in R", their convolution u * v is 
defined by 


(3) (u * v(x) = | 


u(y)o(x — y) dy, 
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provided that the integral exists for all (or at least for almost all) x € R", in 
the Lebesgue sense. Because of (2), 


(4) (u * v)(x) = { u( y(t, O)(y) dy. 


This makes it natural to define 
(5) (u« p(x) =u(t,¢) (UEeTD,pED, xe R’), 


for if u is a locally integrable function, (5) agrees with (4). Note that u * @ 1s 
a function. 

The relation { (t,u):-v=Ju-(t_,v), valid for functions u and 1, 
makes it natural to define the translate t, u of ue D’ by 


(6) (t,uo) = u(t_.@) (PED, xe R’). 


Then, for each x € R", t, u € G’; we leave the verification of the appropriate 
continuity requirement as an exercise. 


6.30 Theorem Supposeuce 9',d6€ D,W Ee QB. Then 
(a) t,(u* d) =(t,u) * d =u * (t, h) forallx € R'; 
(b) ux deC® and 

D%(u * d) = (D*u) * @ = u * (D*¢) 


for every multi-index a; 


(c) ux(P* p)=(u* o)* YW. 


PROOF. For any y € R’, 
(c,(u * b)\(y) = (u * by — x) = ul(ty_. 9), 
(cu) * Py) = (t,u\(t, O) = u(ty_.), 
(u * (t, 6) y) = u(t,(t, 6”) = ult, x), 
which gives (a); the relations 
Ceo Toe and (t,o)” =t_,@ 


were used. In the sequel, purely formal calculations such as the pre- 
ceding ones will sometimes be omitted. 
If u is applied to both sides of the identity 


(1) t,((D*p)”) = (— 1)!*'D%(,. 9) 
one obtains part of (b), namely, 


(u « (D*d)\(x) = (D*u) * $)(x). 
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To prove the rest of (b), let e be a unit vector in R", and put 


(2) N=0 (toe) (rr > 0). 
Then (a) gives 
(3) n,(u * p) = u * (1, ®). 


Asr—0,7,¢@ — D, ¢ in J, where D, denotes the directional derivative 
in the direction e. Hence 


t((n, )”) > t(D. o)” in Q, 


for each x € R", so that 


(4) lim (u * (n, &)\(x) = (u * (D. &))(x). 
By (3) and (4) we have 
(5) D{u * o) = u * (D. >), 


and iteration of (5) gives (b). 
To prove (c), we begin with the identity 


(6) (P+ W)"(D) = | Y(s)(z, P(t) ds. 


Let K, and K, be the supports of ¢ and . Put K = K, + K,. Then 
s > Ws)t, d 


is a continuous mapping of R” into Z,, which is 0 outside K,. There- 
fore (6) may be written as a ,-valued integral, namely, 


(7) (p * py)’ = | Myr ds, 
and now Theorem 3.27 shows that 
(u * (p * WO) = ud * W)’) 
= | Hout y) ds = | W(—si(u * bs) ds, 


or 
(8) (u * (pb * W))(0) = (u * G) * WO). 


To obtain (8) with x in place of 0, apply (8) to t_, w in place of 
w, and appeal to (a). This proves (c). //// 
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6.31 Definition The term approximate identity on R" will denote a 
sequence of functions h; of the form 


hx) = j"h(jx) Che i 233 ee): 
where h € A(R"), h = 0, and Jpn h(x) dx = 1. 


6.32 Theorem Suppose {h;} is an approximate identity on R", 6 € D, and 
ué Y. Then 


(a) lmgd*«h,;= din, 


jJ7 oo 
(b) limusxh,=uind’. 


jo 


Note that (b) implies that every distribution is a limit, in the topology 
of 9’, of a sequence of infinitely differentiable functions. 


PROOF. It is a trivial exercise to check that f* h;-f uniformly on 
compact sets, if f is any continuous function on R”. Applying this to 
D*@ in place of f, we see that D*(¢ * h,) > D*@ uniformly. Also, the 
supports of all @ « h; lie in some compact set, since the supports of 
the h, shrink to {0}. This gives (a). 

Next, (a) and statement (c) of Theorem 6.30 give (b), because 


u() = (u * o)(0) = lim (u « (h; * $))(0) 


= lim ((u * hj) * $)(0) = lim (u * hj). //// 
6.33 Theorem 
(a) Ifue QD’ and 
(1) Lp=uxd (pe), 
then L is a continuous linear mapping of Z into C® which satisfies 
(2) t,L= 1, (x € R"). 


(b) Conversely, if L is a continuous linear mapping of 2 into C(R"), and if L 
Satisfies (2), then there is a unique u € &' such that (1) holds 


Note that (b) implies that the range of L actually lies in C®. 


PROOF. (a) Since t,(u * @) = u * (t, 0), (1) implies (2). To prove that L 
is continuous, we have to show that the restriction of L to each D, is 
a continuous mapping into C®™. Since these are Frechet spaces, the 
closed graph theorem can be applied. Suppose that @; > @ in D, and 
that u « ¢,; > fin C®; we have to prove that f= u * @. 
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Fix x € R". Then t, ¢; > 1, ¢ in J, so that 


f(x) = lim (u « $)(x) = lim u(z, }) = u(t, b) = (u * $)(~). 


(b) Define u(d) = (L¢)\(0). Since ¢ > @ is a continuous operator 
on Y, and since evaluation at 0 is a continuous linear functional on C, 
u is continuous on Y. Thus u € &’. Since L satisfies (2), 


(Lox) = (t-, Lb)(0) = (Lt_, 6)(0) 
= u((t_. P)Y) = ult, b) = (u * P(x). 


The uniqueness of u is obvious, for if ue D’ and u * d = 0 for 
every @ € JY, then 


u(p) = (u * $0) = 
for every ¢ € 9; hence u = 0. /// 


6.34 Definition Suppose now that ue’ and that u has compact 
support. By Theorem 6.24, u extends then in a unique fashion to a contin- 
uous linear functional on C®. One can therefore define the convolution of u 
and any @ € C® by the same formula as before, namely, 


6.35 


(a) 


(ux p(x) =u(t,¢) (x € R”). 


Theorem Suppose ue ZY’ has compact support, and @ € C®. Then 


t,(u + @) = (tu) * b =u * (1, b) if x © R’, 


(b) ux de C® and 


Du * bd) = (D*u) * @ = u * (D%¢). 


If, in addition, W € D, then 


(c) ux we Q,and 
(4) ux(pxp)=(Ux db) * Pp =(ux Wp) * O. 


PROOF. The proofs of (a) and (b) are so similar to those given in 
Theorem 6.30 that they need not be repeated. To prove (c), let K and 
H be the supports of u and w, respectively. The support of 1, is 
x — H. Therefore 


(u * W(x) = u(t, p) = 


unless K intersects x — H, that is, unless x e K + H. The support of 
u * Ww thus lies in the compact set K + H. 

To prove (d), let W be a bounded open set that contains K, and 
choose ¢, € Z so that ¢, = d in W + H. Then (o * W)Y = (ho * W)’ 
in W, so that 


(1) (u * (pb * W)NO) = (u * (Po * WO). 
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If —se H, then t,¢ = 1,4) in W; hence u « 6 =u * $y in —H. 


This gives 

(2) ((u * p) * WO) = (U * Go) * WYO). 
Since the support of u * w lies in K + H, 

(3) ((u * w) * PO) = (Cu * W) * G(0). 


The right sides of (1) to (3) are equal, by Theorem 6.30; hence so 
are their left sides. This proves that the three convolutions in (d) are 
equal at the origin. The general case follows by translation, as at the 
end of the proof of Theorem 6.30. hey 


6.36 Definition If uc 2’, ve Y, and at least one of these two distribu- 
tions has compact support, define 


(1) Lod =u * (v * d) (pe &). 


Note that this is well defined. For if v has compact support, then 
v* geQ, and Ld € C®; if u has compact support, then again L@ € C®, 
since v * @ € C®. Also, t, L = Lt,, for all x € R”. These assertions follow 
from Theorems 6.30 and 6.35. 

The functional @—(L¢)0) is in fact a distribution. To see this, 
suppose ¢, > 0 in Y. By (a) of Theorem 6.33, v * ¢, > 0 in C®; if, in addi- 
tion, v has compact support then v « ¢; > 0 in @. It follows, in either case, 
that (L¢,\(0) > 0. 

The proof of (b) of Theorem 6.33 now shows that this distribution, 
which we shall denote by u * v, is related to L by the formula 


(2) Lo =(u * v) * (PE Y). 
In other words, u * v € Y’ is characterized by 
(3) (uxv)* d=u* (v * d) (pe QD). 


6.37 Theorem SupposeuceD',veD,we 


(a) If at least one of u, v has compact support, then u * v = v * Uu. 
(b) If S, and S, are the supports of u and v, and if at least one of these is 
compact, then 
Say] Ou te Des 


(c) If at least two of the supports S,, S,, S 


(ux v)*w=u x (v * Ww). 


>> Sw are compact, then 


(d) Ifo is the Dirac measure and « is a multi-index, then 
D*u = (D%0) * u. 


In particular, u = 6 * U. 
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(e) If at least one of the sets S,, S, is compact, then 


D*(u * v) = (D*u) * v = u * (Dv) 


for every multi-index «a. 


Note: The associative law (c) depends strongly on the stated hypoth- 
eses; see Exercise 24. 


PROOF. (a) Pick ¢ € JY, W € @. Since convolution of functions is com- 
mutative, (c) of Theorem 6.30 implies that 


(us v) (pW) =usx (pep) 
=us((v*d)*W=us(y*(v* dO). 


If S, 1s compact, apply (c) of Theorem 6.30 once more; if S, is 
compact, apply (d) of Theorem 6.35; in either case 


(1) (u * v) * (p * W) = (u * p)  (v * @). 
Since @ * wy = w * ¢, the same computation gives 
(2) (v x u) * (p * W) =(v * ) * (ux ). 


The two right members of (1) and (2) are convolutions of func- 
tions (one in J, one in C%); hence they are equal. Thus 


(3) ((u * v) * ob) * W =((v * u) * G) * 


Two applications of the uniqueness argument used at the end of the 
proof of Theorem 6.33 now give u * v =v * u. 


(b) Ifo@ € Y,a simple computation gives 
(4) (u * v)\(p) = u((v * G)’). 


By (a) we may assume, without loss of generality, that S,, 1s compact. 
The proof of (c) of Theorem 6.35 shows that the support of v * ¢ lies 
in S, — S,. By (4), (u * v)(p) = 0 unless S, intersects S, — S,, that is, 
unless S, intersects S, + S,. 


(c) Weconclude from (b) that both 
(u * v) * Ww and u*(v * w) 


are defined if at most one of the sets S,, S,, S,, fails to be compact. If 
od € Q, it follows directly from Definition 6.36 that 


(5)  (u* (v * Ww) * P =u ((v * W) * G) =U (UD * (W # @)). 


If S,, is compact, then 


(6) ((u * v) * w) * d =(u * v) * (W * d) =U * (V * (w * )) 
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because w * @ € &, by (c) of Theorem 6.35. Comparison of (5) and (6) 
gives (c) whenever S,, is compact. 

If S, is not compact, then S, is compact, and the preceding case, 
combined with the commutative law (a), gives 


ux*(v * w) =U * (Ww * v) = (W * v) * 
= w * (v * u) = Ww * (u * V) = (u * V) * OW. 
(qd) Ifpe GD, then 6 * d = @, because 
(5 * p)(x) = 6(t, d) = (t. G0) = $(—x) = P(x). 
Hence (c) above and (b) of Theorem 6.30 give 
(D*u) * 6 =u * D*hd =u * D6 * d) = u * (D%O) * ©. 
Finally, (e) follows from (d), (c), and (a): 


D*(u * v) = (D%6) * (u * v) = ((D%6) * u) * v = (D*u) * v 
and 
((D%5) * u) * v = (u * D*d) * v = u * ((D%6) * v) =u x Dv. //// 


Exercises 


1. 


Suppose f is a complex continuous function in R", with compact support. Prove 
that WP; f uniformly on R", for some wy € Z and for some sequence {P,} of 
polynomials. 


. Show that the metrizable topology for GQ) that was rejected in Section 6.2 1s 


not complete for any Q. 


. If E is an arbitrary closed subset of R”, show that there is an f € C“(R"”) such 


that f(x) = 0 for every x € E and f(x) > 0 for every other x € R". 


. Suppose A € Y’(Q) and Ad => 0 whenever ¢ € AQ) and ¢ = 0. Prove that A is 


then a positive measure in Q (which is finite on compact sets). 


. Prove that the numbers c,, in the Leibniz formula are 


n 


II a; ! 

C= ——_—__—__——. 

? iz1 Bi (a; — Bi! 

. (a) Suppose c,, = exp {—(m!)!}, m = 0, 1, 2,... . Does the series 
d &m(D" XO) 
m=0 


converge for every @ € C™(R)? 

(b) Let Q be open in R", suppose A; € Z’(Q), and suppose that all A; have their 
supports in some fixed compact K < ©. Prove that the sequence {A,} cannot 
converge in Y(Q) unless the orders of the A, are bounded. Hint: Use the 
Banach-Steinhaus theorem. 

(c) Can the assumption about the supports be dropped in (b)? 
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7. Let Q = (0, 0). Define 
foe) a 1 
Ao = > (D"¢) m [¢ € BQ)]. 
m= 1 


Prove that A is a distribution of infinite order in ©. Prove that A cannot be 
extended to a distribution in R; that is, there exists no Ay € YR) such that 
Ay = A in (0, 00). 

8. Characterize all distributions whose supports are finite sets. 

9. (a) Prove that a set E < YQ) is bounded if and only if 


sup {|A@|: ¢ € E} < 


for every A € JQ). 

(b) Suppose {@,} is a sequence in BQ) such that {A@,} is a bounded sequence 
of numbers, for every A € 9(Q). Prove that some subsequence of {¢,} con- 
verges, in the topology of BQ). 

(c) Suppose {A,} is a sequence in Y’(Q) such that {A,¢} is bounded, for every 
@ € AQ). Prove that some subsequence of {A,} converges in YQ) and that 
the convergence is uniform on every bounded subset of BQ). Hint: By the 
Banach-Steinhaus theorem, the restrictions of the A; to Dx are equi- 
continuous. Apply Ascoli’s theorem. 

10. Suppose { f;} is a sequence of locally integrable functions in Q (an open set in 

R") and 


lim | |f(x)| dx =0 


i>mo JK 


for every compact K <Q. Prove that then D*f; > 0 in Z’(Q), as i > oo, for every 
multi-index a. 

11. Suppose Q is open in R’, and { f,} is a sequence of harmonic functions in Q that 
converges in the distribution sense to some A € 9'(Q); explicitly, the assumption 
is that 


Ag = lim | f«x)@(x) dx = [d € BQ). 
imo JQ 

Prove then that { f;} converges uniformly on every compact subset of Q and that 
A is a harmonic function. Hint: If f is harmonic, f(x) is the average of f over 
small circles centered at x. 

12. Recall that 6 (the Dirac measure) is the distribution defined by 6(¢) = (0), for 
go € HR). For which f € C%(R) is it true that fd’ = 0? Answer the same question 
for f6”. Conclude that a function fe C~(R) may vanish on the support of a 
distribution A € 9’(R) although fA + 0. 

13. If 6 € BQ) and A € Y'(Q), does either of the statements 


oA =0, Ad=0 


imply the other? 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 
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Suppose K is the closed unit ball in R", A € 9’(R") has its support in K, and 
f € C”(R") vanishes on K. Prove that fA = 0. Find other sets K for which this is 
true. (Compare with Exercise 12.) 


Suppose K c V <Q, K is compact, V and Q are open in R", A € 9’(Q) has its 
support in K, and {¢,} < FQ) satisfies 
(a) lim sup | (D*pXx) | =0 
itxo LxeVv 
for every multi-index «. Prove that then 
lim A(@¢,) = 0. 
to 


The preceding statement becomes false if V is replaced by K in the hypothesis 
(a). Show this by means of the following example, in which Q = R. Choose 
C, >C, >: >0,such that )) c; < «; define 


Ag = pa (P(c))— 60) (PE AR)); 


and consider functions ¢; € A(R) such that d{x) = 0 if x <c¢;4,, d{x) = 1/i if 
C; < x < c,. Show also that this A is a distribution of order 1. 

However, for certain K, V can be replaced by K in the hypothesis (a) of 
Exercise 15. Show that this is so when K is the closed unit ball of R”. Find other 
sets K for which this is true. 

If A € 2(R) has order N, show that A = D'*f, for some continuous function f. 
If A = 6, what are the possibilities for f ?. 

Express 6 € Y’(R’) in the form given by Theorem 6.27, as explicitly as you can. 
Suppose A € 9’(Q), @ € AQ), and (D*o\x) = 0 for every x in the support of A 
and for every multi-index a. Prove that Ad = 0. Suggestion: Do it first for dis- 
tributions with compact support, by the method used in Theorem 6.25. 

Prove that every continuous linear functional on C%(Q) is of the form f— Af, 
where A is a distribution with compact support in Q; this is a converse to (d) of 
Theorem 6.24. 

Let C%(T) be the space of all infinitely differentiable complex functions on the 
unit circle T in ¢. One may regard C™(T) as the subspace of C“(R) consisting of 
those functions that have period 22. Suppose 


f= di 4,2" 
n=0 


converges in the open unit disc U in ¢. Prove that each of the following three 
properties of fimplies the other two: 
(a) There exist p < o and y < o such that 


|a,|<y-n? (n= 1523, 253): 
(b) There exist p < o0 and y < o such that 
Iflsy-G—-I|z))? (eV). 
(c) lim,.., Jt, f(re®)(e®) dO exists (as a complex number) for every @ € C*(T). 
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22. For u € Y’(R), show that 


u—T,U 


+Du in 9(R), 
x 


as x — 0. (The derivative of u may thus still be regarded as a limit of quotients.) 
23. Suppose { f;} is a sequence of locally integrable functions in R", such that 


lim (f, * d(x) 


exists, for each @ € D(R") and each x € R". Prove that then {D*(f; + ¢)} con- 
verges uniformly on compact sets, for each multi-index «. 
24. Let H be the Heaviside function on R, defined by 


H(x) = 1 ifx > 0, 
cas Fa eee 


and let 6 be the Dirac measure. 

(a) Show that (H * ¢Xx) = [*,, G(s) ds, if 6 € A(R). 

(b) Show that 0’ * H = 6. 

(c) Show that 1 * 6’ = 0. (Here 1 denotes the locally integrable function whose 
value is 1 at every point and which is thought of as a distribution.) 

(d) It follows that the associative law fails: 


1*(0'* H)=1+6=1, 
but 
(1 +«06)*H=0+H=0. 


25. Here is another characterization of convolutions analogous to Theorem 6.33. 
Suppose L is a continuous linear mapping of Z into C® which commutes with 
every D*, that 1s, 


(a) LD*@ = D*L@ (pe QD). 
Then there is a u € @ such that 
Lod =u * ¢. 
Suggestion: Fix @ € J, put 
h(x) = (t_, Lt, OXO) = (Lt, Px) = (x € R"), 


let D, be the directional derivative used in the proof of Theorem 6.30, and show 
that 


(D. hXx) = (D, Lt, Xx) — (Lt, D. OX), 


which is 0 if (a) holds. Thus h(x) = h(0), which implies that t, L = Lt,. 
Can the assumption that the range of L is in C® be weakened? 
26. If fe L-((— «, —4) u (6, «)) for every 5 > 0, define its principal value integral to 


be 
PV {" f(x) dx = lim (| + [)409 dx, 
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if the limit exists. For @ € A(R), put 


Ad = | (x) log |x| dx. 
Show that 


= dx 
A’oé = PV p(x) —, 
a x 
p(x) — P(0) co 


x? 


A" = -Pv | 


27. Find all distributions u € Y’(R") that satisfy at least one of the following two 
conditions: 
(a) t,.u = u for every x € R", 
(b) D*u = 0 for every a with |a| = 1. 


CHAPTER 


FOURIER 
TRANSFORMS 


Basic Properties 


7.1 Notations (a) The normalized Lebesgue measure on R" is the 
measure m, defined by 


dm,(x) = (2n)~"/? dx. 


The factor (2x)~"/? simplifies the appearance of the inversion theorem 7.7 
and the Plancherel theorem 7.9. The usual Lebesgue spaces L’?, or L?(R"), 
will be normed by means of m,: 


1/p 
is,=4| If? im, (1 < p < o). 


It is also convenient to redefine the convolution of two functions on R” by 


(f * g\(x) = | f(x — y)g(y) am,(y) 


whenever the integral exists. 
(b) For each t € R", the character e, is the function defined by 


e(x) =e * = exp fityxy too + t,x} (x RY). 


182 
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Each e, satisfies the functional equation 


e(x of y) a e(x)e,(y ). 


Thus e, is a homomorphism of the additive group R” into the multiplicative 
group of the complex numbers of absolute value 1. 


(c) The Fourier transform of a function fe L'(R") is the function f 
defined by 


fio =| fe_,dm, — (t € R’). 


The term “Fourier transform” is often also used for the mapping that takes 
fto f. Note that 


f(t) =(f * eX). 


(d) If « is a multi-index, then 


PRO 2 ee oe eee 
Pe ONG) NG oe) 


The use of D, in place of D* simplifies some of the formalism. Note that 


— 72 
D,e, = te, 


where, as before, t* = tf! --- t". If P is a polynomial of n variables, with 
complex coefficients, say 


PE) = cyt = Cy EH + Es, 
the differential operators P(D) and P(— D) are defined by 
P(D)=Sc,D,, — P(—D) = ¥ (= 1)!"le, Dy. 
It follows that 
P(D)e, = P(t)e, (t € R"). 
(e) The translation operators t, are defined, as before, by 


(t~ SM) =fly—x) — (% y € R"). 


7.2 Theorem Suppose f, g € L'(R"), x € R". Then 


(a) (t, f)* = e_xf; 

(b) (e,f)* = TSS 

(c) (fg) =f9. 

(d) IfA>Oand h(x) = f(x/A), then h(t) = A"f (Ar). 
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PROOF. It follows from the definitions that 


(tT, Nro= | I) t= \r Tye a= |r e_(xje_, = e_,(f (0) 


and 


(e, f)(t) = [ese — | feu» mn (t, f (0). 


An application of Fubini’s theorem gives (c); (d) is obtained by a 
linear change of variables in the definition of f. //// 


7.3 Rapidly decreasing functions This name is sometimes given to 
those f € C~(R") for which 


(1) a sup (1 +|x|*)"|(D, f)(x)| < 00 


for N = 0, 1, 2, .... (Recall that |x|? = }° x?.) In other words, the require- 
ment is that P - D, fis a bounded function on R", for every polynomial P 
and for every multi-index «. Since this is true with (1 + |x|*)*P(x) in place 
of P(x), it follows that every P - D, flies in '(R’"). 

These functions form a vector space, denoted by ,, in which the 
countable collection of norms (1) defines a locally convex topology, as 
described in Theorem 1.37. 

It is clear that A(R") c F,,. 


7.4 Theorem 


(a) &, is a Fréchet space. 
(b) If P is a polynomial, ge ¥,, and « is a multi-index, then each of the 
three mappings 


f>Pf fog, fod 


is a continuous linear mapping of &,, into S,,. 
(c) Iffe ¥,, and P is a polynomial, then 


(P(D)f)" = Pf and (Pf) = P(—D)f. 


(d) The Fourier transform is a continuous linear mapping of /,, into S,,. 


[ Part (d) will be strengthened in Theorem 7.7. ] 


PROOF. (a) Suppose { f;} is a Cauchy sequence in /,. For every pair of 
multi-indices « and B the functions x’D*f,(x) converge then (uniformly 


71.5 
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on R") to a bounded function g,,, as i oo. It follows that 


JaplX) = x?D*goo(x) 
and hence that f; > gp, in Y,. Thus /, is complete. 


(b) If fe F,, it is obvious that D, fe ¥,, and the Leibniz 
formula implies that Pf and gf are also in ¥,. The continuity of the 
three mappings is now an easy consequence of the closed graph 
theorem. 


(c) Iffe X,,s01s P(D)f, by (b), and 
(P(D)f) * e, =f * P(D)e, =f * P(tje, = P()LS * e,]. 


Evaluation of these functions at the origin of R,, gives the first 
part of (c), namely, 


(P(D) f)"*(t) = POS (0. 
Ift =(t,,...,t,)andt’ = (t, + 6, ae t,), € # O, then 
f(t) —f() 
1€ 


-|, Lf ea. 


The dominated convergence theorem can be applied, since x, fe L’, 
and yields 


ia f0- [ xf (oe * dm,(x) 


Ps 


This is the case P(x) = x, of the second part of (c); the general case 
follows by iteration. 

(d) Suppose fe YS, and g(x) = (—1)'*!x*f(x). Then g € %,; now 
(c) implies that g = D wf and P- D, f= P- g =(P(D)g)’, which is a 
bounded function, since P(D)g € L1(R’). This proves that fe Y,. If 
f.ofin F&,, then f,>f in L(R"). Therefore f(t) > f(t) for all t € R". 
That ff is a continuous mapping of Y, into #, follows now from 
the closed graph theorem. ids 


Theorem [ff ce L'(R"), then fe C,(R”), and || f ||, < fl. 


Here C,(R") is the supremum-normed Banach space of all complex 


continuous functions on R” that vanish at infinity. 


PROOF. Since | e,(x)| = 1, it 1s clear that 


(1) IfOl<ifl, (fel, te R%). 


Since A(R")< ¥,, F, is dense in L'(R"). To each fe L'(R") corre- 
spond functions f,¢ “, such that ||f—fil, 20. Since feY, 
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C,(R") and since (1) implies that f, +f uniformly on R", the proof is 
complete. //// 


The following lemma will be used in the proof of the inversion 
theorem. It depends on the particular normalization that was chosen 
for m,,. 


7.6 Lemma _ If @, is defined on R" by 


(1) b,(x) = exp {—4|x|?} 
then d, € S,, 0, = >,, and 


(2) ,(0) = | , dm, 
Ra 
PROOF. It is clear that ¢, € /,. Since ¢, satisfies the differential equa- 
tion 
(3) y+xy=0, 


a short computation, or an appeal to (c) of Theorem 7.4, shows that 
@, also satisfies (3). Hence ¢,/¢, is a constant. Since ¢,(0) = 1 and 


00 


exp {—4x*} dx = 1, 


(0) -| op, dm, = (2n)~ */? | 


we conclude that ¢, = ¢,. Next, 


(4) PrlX) = Py(X1) °° Pi(X,) (x € R") 

so that 

(5) Plt) = Pilts)-“* bit.) (te R”). 

It follows that ¢, = @, for all n. Since ,(0) = | ¢, am,, by definition, 
and since ¢, = ¢,, we obtain (2). //// 


7.7 The inversion theorem 
(a) Ifge F,, then 
(1) g(x) = | ge, dm, (xe R’"). 
Ra 


(b) The Fourier transform is a continuous, linear, one-to-one mapping of &,, 
onto #,,, of period 4, whose inverse is also continuous. 
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(c) Iff e UR"), fe UR”, and 


(2) fo(x) = | fe,dm, (x ER"), 


then f(x) = fo(x) for almost every x € R". 


PROOF. If f and g are in L/(R"), Fubini’s theorem can be applied to the 
double integral 


{ : f(x)a(y)e~'*"” dm, (x) dm,(y) 


to yield the identity 


(3) | fg dm, = { fg dm,,. 


To prove part (a), take ge S¥,, GE F,, f(x) = &(x/A), where 
A > 0. By (d) of Theorem 7.2, (3) becomes 


g(t)a"d(at) dm,(t) = | $( ~ \g(y) dm,(y), 
Rn Rn A 


Or 


(4 { (2600 dm,(t) = | o(2}a0n dm,(y). 


As A> 0, g(t/A) > g(0) and ¢(y/A) > (0), boundedly, so that the 
dominated convergence theorem can be applied to the two integrals in 
(4). The result is 


(5) g(9) | ¢ dm, = $(0) | gdm, (9,0€P,). 


If we specialize @ to be the function @¢, of Lemma 7.6, (5) gives the 
case x = 0 of the inversion formula (1). The general case follows from 
this, since (a) of Theorem 7.2 yields 


a0) = (©-.930) = | (t_.g)” im, = | ge, dm, . 


This completes part (a). 

To prove part (b), we introduce the temporary notation Dg = g. 
The inversion formula (1) shows that ® is one-to-one on Y,, since 
g = 0 obviously implies g = 0. It also shows that 


(6) Og=g 
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where, we recall, g(x) = g(— x), and hence that ®*g = g. It follows that 
® maps ¥, onto £,. The continuity of ® has already been proved in 
Theorem 7.4. To prove the continuity of ®~‘, one can now either refer 
to the open mapping theorem or to the fact that 0! = ®°. 

To prove (c), we return to the identity (3), with g e Y,. Insert 
the inversion formula (1) into (3) and use Fubini’s theorem, to obtain 


(7) | fod dm, = | fgdm, (geF,). 
Rn Rn 
By (b), the functions g cover all of Y,. Since BR") < F¥,, (7) implies 
that 
(8) | (fo —f) dm, = 0 
Rn 


for every @ € A(R"), hence (by a uniform approximation described in 
Exercise | of Chapter 6) for every continuous ¢@ with compact 
support. It follows that fp — f= 0 ae. /T// 


7.8 Theorem Iffe ¥, andgeY,, then 


(a) f*geF,,and 
(b) (fg) =f x G. 


PROOF. By (c) of Theorem 7.2, (f * g)* =f, or 


(1) O(f * g) = Of: Og, 


in the notation used in the proof of (b) of Theorem 7.7. With f and g in 
place of f and g, (1) becomes 


(2) Of * §) = Of 07g = fg = (fg)’ = O(f9). 

Now apply ®~' to both sides of (2) to obtain (b). Note that fg € Y,,; 
hence (b) implies that f* ge Y,, and this gives (a), since the Fourier 
transform maps Y, onto &,. //// 


7.9 The Plancherel theorem There is a linear isometry V of I?(R") onto 
I?(R") which is uniquely determined by the requirement that 


Wf=f for every fe S,. 


Observe that the equality ¥f = f extends from Y, to’ 7 L, since F, 
is dense in I? as well as in L’. This gives consistency: The domain of ¥ is I’, 
f was defined in Section 7.1 for all fe L’, and ¥f =f whenever both defini- 
tions are applicable. Thus extends the Fourier transform from L’ 7 LI? to 
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L’. This extension is still called the Fourier transform (sometimes the 
Fourier-Plancherel transform), and the notation f will continue to be used in 
place of ‘Pf, for any fe L°(R"). 


PROOF. If fand g are in Y,,, the inversion theorem yields 


| fg dm, = | g(x) dm,(x) { f(te*'* dm,(t) 


= | f(t) dm,(t) | g(xje*"' dm,(x). 


The last inner integral is the complex conjugate of g(t). We thus get 
the Parseval formula 


(1) | faim, = | fgdm, (f£g9¢ef,). 
Ra Ra 
If g = f, (1) specializes to 


(2) Wfll2=Wle Ge F,). 


Note that ,, is dense in I?(R"), for the same reason that Y,, is 
dense in L'(R"). Thus (2) shows that f/f is an isometry (relative to 
the I?-metric) of the dense subspace /, of I?(R") onto /,. (The map- 
ping is onto by the inversion theorem.) It follows, by elementary 
metric space arguments, that ff has a unique continuous extension 
VY: [?(R") > 2(R") and that this ¥ is a linear isometry onto L?(R’). 
Some details of this are given in Exercise 13. //// 


It should be noted that the Parseval formula (1) remains true for arbi- 
trary fand g in [7(R’). 

That the Fourier transform is an I?-isometry is one of the most 
important features of the whole subject. 


Tempered Distributions 


Before we define these, we establish the following relation between ¥,, and 
G(R"). 


7.10 Theorem 


(a) QDR") is dense in f,. 
(b) The identity mapping of Z(R") into #,, is continuous. 


These statements refer, of course, to the usual topologies of ZR") and 
SF,, as defined in Sections 6.3 and 7.3. 
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PROOF. (a) Choose fe %,, W € D(R") so that wy = 1 on the unit ball of 
R", and put 


(1) Fx) =f(x)wrx) — (x € R", r > 0). 
Then f, € DR"). If P is a polynomial and « is a multi-index, then 


P(x)D(f — fx) = P(x) Y cag(D* Pf xc)r!*'D°L1 — wir). 
Bsa 


Our choice of y shows that D*[1 — wW](rx) = 0 for every multi-index B 
when |x|-<1/r. Since fe Y,, we have P- D* ®feC,(R") for all 
B < «. It follows that the above sum tends to 0, uniformly on R”, when 
r—Q. Thus f, > fin S,,, and (a) is proved. 

(b) If K is a compact set in R", the topology induced on Dy, by 
SF, 1s Clearly the same as its usual one (as defined in Section 1.46), 
since each (1 +|x|?)* is bounded on K. The identity mapping of D, 
into Y, is therefore continuous (actually, a homeomorphism), and 
now (b) follows from Theorem 6.6. //// 


7.11 Definition If i: A(R") FY, is the identity mapping, if L is a con- 
tinuous linear functional on Y,, and if 


(1) up= Loi 


then the continuity of i (Theorem 7.10) shows that u,; € Z’(R"); the dense- 
ness of Z(R") in #, shows that two distinct L’s cannot give rise to the same 
u. Thus (1) describes a vector space isomorphism between the dual space %, 
of Y,, on the one hand, and a certain space of distribution on the other. 
The distributions that arise in this way are called tempered: 

The tempered distributions are precisely those u € Q'(R") that have con- 
tinuous extensions to /,,. 

In view of the preceding remarks, it is customary and natural to iden- 
tify u, with L. The tempered distributions on R" are then precisely the 
members of &,,. 

The following examples will explain the use of the word “tempered” 
in this connection; it indicates a growth restriction at infinity. (See also 
Exercise 3.) 


7.12 Examples (a) Every distribution with compact support is tempered. 
Suppose K is the compact support of some u € Y’(R"), fix wy € DR") so that 
yw = 1 in some open set containing K, and define 


(1) uf)=uvf) (fe F,). 


If f, > 0in Y,, then all D*f, +0 uniformly on R", hence all D*(wf;) > 0 uni- 
formly on R", so that wf; > 0 in DR"). It follows that u is continuous on 
SF, Since ud) = u(d) for @d € D(R"), vis an extension of u. 
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(b) Suppose p is a positive Borel measure on R" such that 


(2) | (1 + |x|7)~* dulx) < 


for some positive integer k. Then p is a tempered distribution. The assertion 
is, More explicitly, that the formula 


3) ee | fdu 


defines a continuous linear functional on Y,. 
To see this, suppose f; ~ 0 in Y,. Then 


(4) é, = sup (1 + |x]*)*| f(x)| > 0. 


Since | Af;| is at most ¢, times the integral in (2), Af, 0. This proves the 
continuity of A. 


(c) Suppose 1 < p< ao, N >0O, and g is a measurable function on R" 
such that 


(5) { 1 + |x|7)~ g(x) |? dm,(x) = C < o. 


Then g is a tempered distribution. 
As in (b), define 


(6) Af = | fg dm,,. 


Assume first that p > 1; let q be the conjugate exponent. Then Holder’s 
inequality gives 


1/q 
(7) |Af| < crn | [1 + | |7)"F (x) |? dm) 


< C’/?B*4 sup |(1 + | x|*)“f(x) |, 


xeRa 


where M is taken so large that 
| (1 + |x |7)%~™* dm,(x) = B < oo. 
Ra 


The inequality (7) proves that A is continuous on &,. The case p = 1 is 
even easier. 

(d) It follows from (c) that every g € [?(R") (1 < p < o) is a tempered 
distribution. So is every polynomial and, more generally, every measurable 
function whose absolute value is majorized by some polynomial. 
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7.13 Theorem If « is a multi-index, P is a polynomial,g € £,, and uisa 
tempered distribution, then the distributions D*u, Pu, and gu are also tem- 
pered. 


PROOF. This follows directly from (b) of Theorem 7.4 and the defini- 
tions 


(D*u)(f) = (—1)!""u(D*f), 
(Pu)(f) = u(Pf), 
(gu\(f) = u(gf). /[I// 


7.14 Definition For ue &,,, define 
(1) ip) = ud) (bE S,). 


Since ¢ > ¢ is a continuous linear mapping of ¥, into ¥, [(d) of Theorem 
7.4], and since u is continuous on Y,, it follows thatie SF,. 

We have thus associated with each tempered distribution u its Fourier 
transform t, which is again a tempered distribution. Our next theorem will 
show that the formal properties of Fourier transforms of rapidly decreasing 
functions are preserved in the larger setting of tempered distributions. 

But first there arises a consistency question that ought to be settled. If 
fe L'(R"), then f may also be regarded as a tempered distribution, say u,, so 
that two definitions of the Fourier transform are available, namely, (c) of 
Section 7.1 and Definition 7.14. The question is whether they agree, 1.e., 
whether the distribution (u,)* corresponds to the function f. The answer is 
affirmative, because 


(ub) =u) = | jie | jeatuine 


for every ge &,. The third of these equalities is the identity (3) of Section 
7.7; the others are definitions. 

Since [?7(R")< £', the same question arises for the Fourier- 
Plancherel transform. The answer is again affirmative, by the same proof, 
since the identity | f@ = { f@ persists for fe 27(R") and d € F,. 


7.15 Theorem 


(a) The Fourier transform is a continuous, linear, one-to-one mapping of /,, 
onto S,, of period 4, whose inverse is also continuous. 


(b) Ifue ¥, and P is a polynomial, then 
(P(D)u)* = Pa and (Pu)* = P(—D)i. 
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Note that these are the analogues of (b) of Theorem 7.7 and (c) of 
Theorem 7.4. The topology to which (a) refers is the weak*-topology that 
Ff, induces on Y,. Note also that the differential operators P(D) and 
P(—D) are defined in terms of D,, not D*; see (d) of Section 7.1. 


PROOF. Let W be a neighborhood of 0 in #,. Then there exist func- 
tions ¢,,..., d, € F, such that 


(1) {fue F,:|u(d)|<1 for 1<i<k}cw. 
Define 
(2) V={ue Ff: |uld)|<1 for 1<i<kt. 


Then V is a neighborhood of 0 in Y/,, and since 
(3) up)=uld) (PES,,uE S,), 


we see that i € W whenever u € V. This proves the continuity of ®, 
where we write ®u = i. Since ® has period 4 on Y,,, (3) shows that ® 
has period 4 on “, that is, that ®*u = u for every ue &,. Hence ® is 
one-to-one and onto, and since ® =! = ®°, @~! is continuous. 

Statement (b) follows from (c) of Theorem 7.4 and from Theorem 
7.13, by the computations 


(P(D)u)(p) = (P(D)u(p) = u(P(—D)¢) 
= u((P)*) = u(Pd) = (Puy) 


and 
(P(— D)ii\(p) = X(P(D)) = u((P(D)?)”) 
= u(Po) = (Puy) = (Pu)"(¢), 
where ¢ is an arbitrary function in F,. //// 


7.16 Examples We saw in (d) of Section 7.12 that polynomials are tem- 
pered distributions. Their Fourier transforms are easily computed. We 
begin with the polynomial 1; regarded as a distribution on R", 1 acts on test 
functions @ by the formula 


(1) 1($) = | 1 dm, = |  dm,. 


Hence 


(2) i(¢) = 1(6) = { d dm, = $(0) = (9), 
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where 6 is the Dirac measure on R". Likewise, 


(3) 5(¢) = 5(¢) = $(0) = [ p dm, = 1(). 


Thus (2) and (3) give the results 
(4) i=6 and 6 =1. 


If P is now an arbitrary polynomial on R", and if we apply (6) of 
Theorem 7.15 with u = 6 and with u = 1, the results in (4) show that 


(5) (P(D)5)* =P and P= P(—D)6. 


The two formulas in (4) [as well as those in (5)] can also be derived 
from each other by the inversion theorem, which may be stated for tem- 
pered distributions in the following way: 

Ifue F.,, then (i)* = it, where i is defined by 


(6) i(d)=uld) (bE Y,). 
The proof is trivial, since (f)* = ¢, by (a) of Theorem 7.7: 
(a)*(h) = t() = u((G)*) = u(P) = H(¢). 

Note that 6 = 6. 

If we combine (5) with Theorem 6.25, we find that a distribution is the 
Fourier transform of a polynomial if and only if its support is the origin (or 
the empty set). 

The following lemma will be used in the proof of Theorem 7.19. Its 


analogue, with ZR") in place of ,, is much easier and was used without 
comment in the proof of Theorem 6.30. 


7.17 Lemma I[fw=(1,0,...,0)6 RR’, if¢ € F,, and if 


(1) (x) = EA OO) ee Re > 0) 


then o, > 0¢/0x, in the topology of S,, as & > 0. 


PROOF. The conclusion can be obtained by showing that the Fourier 
transform of ¢, — 0¢/0x, tends to 0 in Y,, that is, by showing that 


(2) v.d@-0inY,, ase—0, 
where 


exp (iey,) — 1 


(3) w.(y) = iy, (yeR",c>0). 
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If P is a polynomial and @ is a multi-index, then 


(4) P- D*W.6) = ¥ cap P + (D*-*) « (DP W,). 
B<a 
A simple computation shows that 
Ey if | B| = 0, 
(5) Dw l<yjelyil if |BI=1, 


ell-1 if | BL > 1. 


The left side of (4) tends therefore to 0, uniformly on R", as e > 0. The 
definition of the topology of ¥, (Section 7.3) shows now that (2) 
holds. ate 


7.18 Definition Ifuec Sand de &,, then 
(u + bx) = u(t, g) (xe R") 


Note that this is well defined, since t, 6 € Y, for every x € R". 


7.19 Theorem Suppose ¢ € ¥, and u is a tempered distribution. Then 


(a) ux de C%(R"), and 
D*(u * d) = (D*u) * d = u * (D*¢) 
for every multi-index a, 
(b) u* has polynomial growth, hence is a tempered distribution, 
(c) (ux $)* = ga, 
(dZ) (ux o)* p =u * ( * ), for everyWeF,, 
(ce) a+ p =(pu)’. 


PROOF. The second equality in (a) is proved exactly as in Theorem 
6.30, since convolution obviously still commutes with translations. 
This also shows that 


(1) (a= \u «p)=u% (===) 


Lemma 7.17 now gives D%(u * @) = u * (D%¢) if « = (1, 0, ..., 0). Iter- 
ation of this special case gives (a). 

Let py(f) denote the norm (1) of Section 7.3, for fe Y,. The 
inequality 


(2) L+|x+yP?<21+ |xll+lyl?) Gye R") 
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shows that 


(3) Putz f) < 2% +|x)?)*pw(f) = (we RY fe F,). 


Since u is a continuous linear functional on ¥, and since the norms 
Py determine the topology of ,, there is an N and a C < © such 


that 

(4) luf)|< Cpt) (fe F,); 

see Chapter 1, Exercise 8. By (3) and (4), 

(5) J(u * P)(x)| = |ult, | < 2%Cpa(PN1 + 1x17)", 
which proves (b). 


Thus u * @ has a Fourier transform, in %,. If wy € G(R"), with 
support K, then 


(u « o)*(b) = (u * OW) = | (u * Px)(—x) dm,(x) 


{ ulw(—x)t,.6] dm,(x) = u [ W(—x)t, & dm,(x) 
u((p * W)’) = a(d * W)*) = ad) 


so that 
(6) (u « b)*(W) = (Pay). 


In the preceding calculation, Theorem 3.27 was applied to an 
SF ,-valued integral, when u was moved across the integral sign. So far, 
(6) has been proved for yw € G(R"). Since G(R") is dense in ¥,, the 
Fourier transforms of members of YR") are also dense in F,, by (b) 
of Theorem 7.7. Hence (6) holds for every fy € S,. The distributions 
(u * $)* and $i are therefore equal. This proves (c). 

In the computation that precedes (6), the two end terms are now 
seen to be equal for any w € F,,.. Hence 


(7) (u * PX) = up * Y)”), 
which is the same as 
(8) ((u + p) * WO) = (u * (p * WO). 


If we replace w by t,.W in (8), we obtain (d). 
Finally, (a « d)* = di = (pu)”, by (c) above and (6) of Section 
7.16; this gives (e), since (@u)” = ((pu)*)*. //// 


Paley-Wiener Theorems 


One of the classical theorems of Paley and Wiener characterizes the entire 
functions of exponential type (of one complex variable), whose restriction to 
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the real axis is in I?, as being exactly the Fourier transforms of [?-functions 
with compact support; see, for instance, Theorem 19.3 of [23]. We shall 
give two analogues of this (in several variables), one for C®~-functions with 
compact support and one for distributions with compact support. 


7.20 Definitions If Q is an open set in €", and if f is a continuous 
complex function in Q, then f is said to be holomorphic in Q if it is holo- 
morphic in each variable separately. This means that if (a,, ..., a,) € Q and 
if 


gf{A) =f (ay, ..-, Aj_-4, A, + A, i441, --+5 Ay), 


each of the functions g,, ..., g, 18 to be holomorphic in some neighborhood 
of 0 in ¢. A function that is holomorphic in all of ¢” is said to be entire. 

Points of ¢” will be denoted by z = (z,, ..., z,), where z, € ¢. If z, = 
X, + Vy, X =(X1,---5 Xn)» V=(V>---> Vy), then we write z=x-+ iy. The 
vectors 


x=Rez and y=Imz 


are the real and imaginary parts of z, respectively; R” will be thought of as 
the set of all z € ¢" with Im z = 0. The notations 
2) Sz errr 2,14) 
[Im 2] = (yp +++ + Ya)? 


a 


— 7% 1... oO 
= 25 a 


Z°€=2,0, +°°'+2Z,1, 
e.(t) = exp (iz - t) 


will be used for any multi-index a and any t € R". 


7.21 Lemma _ If f is an entire function in €" that vanishes on R", then 


f=0. 


PROOF. We consider the case n = 1 as known. Let P, be the following 
property of f: If z e ¢" has at least k real coordinates, then f(z) = 0. 
P,, is given; Py is to be proved. Assume 1 < i <n and P; 1s true. Take 
a,,..., a; real. The function g; considered in Section 7.20 is then 0 on 
the real axis, hence is 0 for all A € ¢. It follows that P;_, istrue. —_//// 


In the following two theorems, 


rB = {x € R": |x| <r}. 
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7.22 Theorem 
(a) If gd € AR") has its support in rB, and if 


(1) f(z) = | pe" dm(t)  (z € €"), 


then f is entire, and there are constants yy < 0 such that 
(2) [fl syd + lz] Net (ze ¢", N =0, 1, 2,...). 


(b) Conversely, if an entire function f satisfies the conditions (2), then there 
exists @ € D(R"), with support in rB, such that (1) holds. 


PROOF. (a) If t € rB then 
fete) _ et < ely iiel < ef lim 2| 


The integrand in (1) is therefore in I/(R"), for every z € £", and f is 
well defined on €”. The continuity of f is trivial, and an application of 
Morera’s theorem, to each variable separately, shows that f is entire. 
Integrations by part give 


z°f(z) = [ (D, p)(t)e*'* dm,(t). 
Hence 


(3) IZ"*\1F(@1 < ID. ole? 


and (2) follows from the inequalities (3). 
(b) Suppose fis an entire function that satisfies (2), and define 


(4) P(t) = | f(xje""* dm,(x) (te R”). 


Note first that (1 + |x|)%f(x) is in L'(R") for every N, by (2). Hence 
go € C%(R"), by the argument that proved (c) of Theorem 7.4. 
Next, we claim that the integral 


(5) | f(€ + in, 22, ..., Z,) exp {i[t,(€ + in) +22. +°-' +t,2,]} dé 


is independent of y, for arbitrary real t,,..., t, and complex z,,..., Z,. 
To see this, let I be a rectangular path in the (& + in)-plane, with one 
edge on the real axis, one on the line 7 = y,, whose vertical edges 
move off to infinity. By Cauchy’s theorem, the integral of the inte- 
grand (5) over I is 0. By (2), the contribution of the vertical edges to 
this integral tend to 0. It follows that (5) is the same for 7 = 0 as for 


n = n,. This establishes our claim. 
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The same can be done for the other coordinates. Hence we con- 
clude from (4) that 


(6) P(t) = | f(x + iyje” =*™ dm, (x) 


for every y € R". 
Given te R", t#0, choose y= At/|t|, where 2>0. Then 
t-y=Altl,|yl =A, 


| f(x + iyje® &F™ | < y(t + [x] Me 


and therefore 


(7) | P(t)| < ywer¥ | (1 + |x|)" * dm,(~), 


where N is chosen so large that the last integral is finite. Now let 
A— o. If|t| > 7, (7) shows that ¢(t) = 0. Thus ¢ has its support in rB. 

Now (1) follows, for real z, from (4) and the inversion theorem. 
Since both sides of (1) are entire functions, they coincide on ¢", by 
Lemma 7.21. This completes the proof. //// 


The following remarks will motivate the next theorem. 


Let u be a distribution in R", with compact support. Then @ is defined, 
as a tempered distribution, by a() = u(d). However, the definition f(x) = 
| fe_,.dm,, made for fe L'(R"), suggests that @ ought to be a function, 
namely, 


u(x)=ule_,) (x € R’), 


because e_, € C~(R") and u(d) makes sense for every ¢ € C~(R"), as shown 
by (d) of Theorem 6.24. Moreover, e_, € C~(R") for every z € ¢", and u(e_.) 
therefore looks like an entire function, whose restriction to R" is i. 

That all this is correct is part of the content of the next theorem, 
which also characterizes the resulting entire functions by certain growth 
conditions. 


7.23 Theorem 


(a) Ifue B(R") has its support in rB, if u has order N, and if 
(1) f(zj=ule_.) (ze €"), 


then f is entire, the restriction of f to R" is the Fourier transform of u, and 
there is a constant y < 0 such that 


(2) If(2)| <p + |z)Metmt (ze G"). 
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(b) Conversely, if f is an entire function in €" which satisfies (2) for some N 
and some y, then there exists u € 9'(R"), with support in rB, such that (1) 
holds. 


Note: The notation & will sometimes be used to denote the extension 
to ¢" given by (1). Thus 
u(z) = u(e_,) 


for ze ¢". This extension is sometimes called the Fourier-Laplace trans- 
form of u. 


PROOF. (a) Suppose u € Y'(R") has its support in rB. Pick w € DR") 
so that y=1 on (r+ 1)B. Then u = Wu, and (e) of Theorem 7.19 
shows that 


(3) i = (Wu)* = i « p. 
Thus a € C®(R"). Pick ¢ € ¥, so that @ = W. Then 
(fi * W(x) = (@ * bx) = a(t, d) = u(t, 9)") 
= u(e_, P) = u(we_,) = u(e_,), 
so that (3) gives 
(4) u(x) = u(e_,) (x € R"). 


Our next aim is to show that the function f defined by (1) is 
entire. Choose a € ¢", b € ¢", and put 


(5) gl) =f(a + Ab) = ule_,-) UE). 


The continuity of f poses no problem: If w—z in ¢", then e_, > e_, 
in C~(R"), and u is continuous on C%(R"). To prove that f is entire it is 
therefore enough to show that each of the functions g defined by (5) is 
entire. 

Let I’ be a rectangular path in ¢. Since 4 e_,_,, is contin- 
uous, from ¢ to C®(R"), the C~(R")-valued integral 


(6) F= [eau dd 
Tr 


is well defined. Evaluation at any t € R" is a continuous linear func- 
tional on C%(R”). It therefore commutes with the integral sign. Hence 


F(t) = [ea w( di = [eterna dA = 0. 
r r 
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Thus F = 0, and (6) gives 


0=u(F) = | Wea di = | 90 dd. 


By Morera’s theorem, g is entire. 

The proof of part (a) will be completed by proving (2). Choose 
an auxiliary function h on the real line, infinitely differentiable, such 
that h(s) = 1 when s < 1 and h(s) = 0 when s > 2, and associate with 
each z € ¢" (z # 0) the function 


(7) o{t)=e"*'A(ltl|zZ|—riz|) (je R"). 


Then $,¢€Q(R"). Since the support of ui is in rB_ and 
A(|t\|z|—r|z|)=1 if |t}<|z|~’ +r, comparison of (1) and (7) 
shows that 


(8) F(z) = u(,). 


Since u has order N, there is a yo < 0 such that | u(d)| < yolldlly 
for all @ € A(R"), where ||¢||, is as in (1) of Section 6.2; see (d) of 
Theorem 6.24. Hence (8) gives 


(9) IF(Z)1 S voll Pzlly- 
On the support of ¢,,|t| <r + 2/|z|, so that 
(10) lee) —git< e2 trlim 2] 


If we now apply the Leibniz formula to the product (7) and use 
(10), (9) implies (2). 

This completes the proof of part (a). 

(b) Since f now satisfies (2), we have 


(11) IFO) Svh + 1x1)" (we R’). 


The restriction of f to R” is therefore in “, and is the Fourier trans- 
form of some tempered distribution u. 

Pick a function h € ZR"), with support in B, such that [ h = 1, 
define h,(t) = e "h(t/e), for e > 0, and put 


(12) fi) =f(2h(z) (ze ©"), 


where h, now denotes the entire function whose restriction to R" is the 
Fourier transform of h,. Statement (a) of Theorem 7.22, applied to h,, 
leads to the conclusion that f, satisfies (2) of Theorem 7.22 with r + ¢ 
in place of r. Therefore (b) of Theorem 7.22 implies that f, = @, for 
some @, € G(R") whose support lies in (r + ¢&)B. 

Consider some we ¥, such that the support of does not 
intersect rB. Then Wd, =0 for all sufficiently small ¢> 0. Since 
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fw € L(R") and h,(x) = h(ex) > 1 boundedly on R", we conclude that 


u(y) = a(p) = | fv dm, = lim | nv dm, 


e>0O 


=lim | ¢,w dm, = lim | vs, dm, = 0. 
e 70 e>0 
Hence u has its support in rB. 
Now we see that z > u(e_,) is an entire function, and since (1) 
holds for z € R" (by the choice of u), Lemma 7.21 completes the proof 


of (b). Hf! 


Sobolev’s Lemma 


If Q is a proper open subset of R", no Fourier transform has been defined 
for functions whose domain is Q or for distributions in Q. Nevertheless, 
Fourier transform techniques can sometimes be used to attack local prob- 
lems. Theorem 7.25, known as Sobolev’s lemma, is an example of this. 


7.24 Definitions A complex measurable function f, defined in an open 
set Qc R", is said to be locally LV in Q if {x | f |? dm, < co for every 
compact K <Q. 

Similarly, a distribution u € J’(Q) is locally 7 if there is a function g, 
locally L’ in Q, such that u(¢) = [g gd dm, for every @ € BQ). To say that 
a function f has a distribution derivative D*f which is locally ? refers to the 
distribution D*f and means, explicitly, that there is a function g, locally LP’, 
such that 


| 90 dm, = (— 1)" | m6 dm, 


for every ¢ € HQ). A priori, this says nothing about the existence of D%f in 
the classical sense, in terms of limits of quotients. 

On the other hand, the class CQ) consists, for each nonnegative 
integer p, of those complex functions f in Q whose derivatives D*f exist in 
the classical sense, for each multi-index « with |a| < p, and are continuous 
functions in Q. 

We shall write D* for the differential operator (0/0x;,)*. 


7.25 Theorem Suppose n, p,r are integers, n > 0, p > 0, and 


n 
1 —, 
(1) sad eae 
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Suppose f is a function in an open set Q < R" whose distribution derivatives 
Dif are locally ? inQ, forl<i<n,0<k<r. 

Then there is a function fy € CQ) such that fo(x) = f(x) for almost 
every x €Q. 


Note that the hypothesis involves no mixed derivatives, i.e., no terms 
like D,D, f. The conclusion is that f can be “corrected” so as to be in 
CQ), by redefining it on a set of measure 0. 

Note also, as a corollary, that if all distribution derivatives of f are 
locally 2 in Q, then fo € C%(Q). 


PROOF. By hypothesis, there are functions g,,, locally in Q, that 
satisfy 


(2) | oud dm, = (—1)" | pie dm, [ee AQ], 


forl <i<n,0<k<r. 
Let w be an open set whose closure K is a compact subset of Q. 
Choose w € BQ) so that wy = 1 on K, and define F on R" by 


_ JW) F() if x € Q, 
Fo) =4° if x ¢ Q. 


Then F € (2 cn LYR"). 
In Q, the Leibniz formula gives 


G)  DiF=y ("hor wyoin = ("or Ww. 
In the complement Q, of the support of w, Di F = 0. These two dis- 
tributions coincide in QA Q,. Hence Dj F, originally defined as a 
distribution in R", is actually in [7(R"), for 1 < i <n, because the func- 
tions (Di ‘w)g,, are in I7(Q). [Having compact support, Dj F is there- 
fore also in L’(R").] 

The Plancherel theorem, applied to F and to D{F, ..., D’F, 
shows now that 


(4) | | F |? dm, < oo 
Rn 
and 
(5) | yi" |F(y) |? dm(y)< 00 (1 <i<n). 
Rn 
Since 


(6) (1+ |y|)?" < (Qn 4+ 2y(1 + yp? + °° + 77), 
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where | y| = (yt + °° + yn)”, (4) and (5) imply 


(7) | (1 + | y|)?"| F(y) |? dm,(y) < 00. 


If J denotes the integral (7), and if o, is the (n — 1)-dimensional 
volume of the unit sphere in R", the Schwarz inequality gives 


i (1 + | yl)P| FO)! dm) <s| (1 + |y|)??~* dm,(y) 


= Jo, | (1 + 2)7?> "rt"! dt < co, 
0 
since 2p — 2r +n— 1 < —1. We have thus proved that 


(8) | (1 + | y1)?| F(y)| dm,(y) < 00. 


Define 


(9) F(x) = | F(y)e*'” dm,(y) (x € R"). 


By (c) of the inversion theorem 7.7, F,, = F a.e. on R". Moreover, (8) 
implies that y*F(y) is in L’ whenever | «| < p. Iteration of the proof of 
(c) of Theorem 7.4 leads therefore to the conclusion 


(10) F,, € CXR’), 


Our given function f coincides with F in w. Hence f= F.,, ae. 
in w. 

If w’ is another set like w, the preceding proof gives a function 
F.,. € CR"), which coincides with f a.e. in w’. Hence F,,. = F,, in 
w' ~ @. The desired function f, can therefore be defined in 2 by 
setting f,(x) = F(x) if x € @. //// 


Exercises 


1. Suppose A is an invertible linear operator on R", fe L'(R"), and g(x) = f(Ax). 


Express g in terms of f This generalizes (d) of Theorem 7.2. 


2. Is the topology of Y, induced by some invariant metric which turns the Fourier 


transform into an isometry of , onto F,? 


3. Suppose f(x) = e*, g(x) = e* cos (e*), on the real line. Show that g is a tempered 


distribution but that fis not. 


4. By Exercise 3 there exist distributions in R" which are not tempered. Such dis- 


tributions are continuous linear functionals on ZR") which have no continuous 


11. 


12. 


13. 


14. 


15. 
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linear extension to #,. Explain why this does not contradict the Hahn-Banach 
theorem. 


. (a) Construct a sequence in YR") which converges to 0 in the topology of , 


but not in that of DR”). 
(b) Construct a sequence of polynomials which converges in the topology of 
3'(R'*) but not in that of S',. 


. Prove that the operations listed in Theorem 7.13 are continuous mappings of 7, 


into #,. 


. Ifue S,, prove that 


(t,u)* =e_,t and (eu) =t,u 


x 


for every x € R". 


. Suppose f € L(R"), f 0, A is a complex number, and f = Af, What can you say 


about A? 


. Prove (a) of Theorem 7.8 directly (without using Fourier transforms). 
10. 


The Fourier transform of a complex Borel measure yw on R" is customarily 
defined to be the function fi given by 


f(x) = | e*''du(t) (xe R"). 
Rn 


Of course, p is also a tempered distribution, and as such its Fourier transform 
was defined in Section 7.14. Show that these two definitions are consistent. 
Prove that each fi is bounded and uniformly continuous. 

Suppose A: #, > C(R") is continuous, linear, and t, A = At, for every x € R". 
Does it follow that there exists u €e Y, such that 


Ad =u* 


for every de &,,? 

If {h,} is an approximate identity, as in Definition 6.31, and ue /,, does it 

follow that u * h; > uasj > oo, in the weak*-topology of %,? 

Suppose X and Y are complete metric spaces, A is dense in X, f: A> Y is 

uniformly continuous. 

(a) Prove that f has a unique continuous extension F: X > Y. 

(b) If f is an isometry, prove that the same is true of F, and prove that F(X) 1s 
closed in Y. 
(This was used in the proof of the Plancherel theorem; see also Exercise 19, 
Chapter 1.) 

Suppose F is an entire function in ¢", and suppose that to each e > 0 there 

correspond an integer N(e) and a constant y(&) < oo such that 


| F(z)| < eX + [z)%erlim2! (x € €"). 


Prove that F is a polynomial. 
Suppose fis an entire function in ¢”, N is a positive integer, r > 0, and 


IfDl< 4+ |z|%eM™?! — for all ze €", 
lf(x)| <1 for all x € R". 
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16. 


17. 


Prove that then 
lf(z)|<e"™?! for allze ¢”. 
Suggestion: Fix z = x + iy € ¢"; define 
g(A) = (1 — isd) N~Pelrbl* F(x + Ay) 


for Ae ¢, s>0, and apply the maximum modulus theorem to a large semi- 
circular region in the upper half-plane to deduce that | g,(i)| < 1. Let s > 0. 
In (b) of Theorem 7.23 it is not asserted that u has order N. The following 
example shows that this is not always true. 

Let u be the Borel probability measure on R* which is concentrated on 
the unit sphere S? and which is invariant under all rotations of S*. Compute (by 
using spherical coordinates) that 


f(x) = (x € R°). 


Put u = D,p. Then 
| u(x) | = |x, H(x)| < 1 (x € R°). 
Deduce from Exercise 15 that 
|u(e_,)|<yel™7! = (ze €°) 


although u is not a distribution of order 0. (Its order is 1.) Find an explicit 

formula for the entire function u(e_,), z € ¢°. 

Suppose u is a distribution in R", with compact support K, whose Fourier trans- 

form i is a bounded function on R". 

(a) Assume n= 1 or n= 2, and prove that wu = 0 for every Ww € C™(R") that 
vanishes on K. 

(b) Assume n = 2, and assume that there is a real polynomial P, in two vari- 
ables, that vanishes on K. Prove that Pu = 0 and that iu therefore satisfies 
the partial differential equation P(— D)i = 0. For example, when K is the 
unit circle, then 


i+ At = 0, 


where A = 07/dx? + 07/dx2 is the Laplacian. 

(c) Show, with the aid of Exercise 16 and the polynomial 1 — xj — x3 — x3, 
that (b), hence also (a), becomes false with n = 3 in place of n = 2. 

(d) Assume n = 1, fe L(R), f= 0 on K, and f satisfies a Lipschitz condition of 
order 4, that is, | f(t) — f(s)| < C|t — s|!*. Prove that then 


{" SF (x)ti(x) dx = 0. 


Suggestion: For any n, let H, be the set of all points outside K whose 
distance from K is less than ¢ > 0. Let {h,} be an approximate identity, as in the 
proof of (b) of Theorem 7.23, use the Plancherel theorem to obtain 


ue Aes < aloe Why lla, 


18. 


19. 


20. 


21. 


22. 
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and show that therefore 


1/2 
|u(P)| < al], ]4, ||, lim inf \e | |p|? im,| 
é70 H, 

for any ¢ € @(R") that vanishes on K. 

This yields (a). A slight modification yields (d); (b) follows from (a). 
Was it necessary to introduce the function w into the proof of Theorem 7.25? 
Could the proof have been simplified by setting F(x) = f(x) on K, F(x) = 0 off 
K? 
Show that the hypotheses of Theorem 7.25 imply that D*f is locally I? for every 
multi-index « with |a| <r. 
Let f € °(R*) be the continuous function whose Fourier transform is 


f(y) =(1 + lvl) A*flog (2 + Ly}* (ye R?). 


Since | y|3f(y) is in 2(R?), Theorem 7.25 implies that f ¢ C"(R?). Show that the 
stronger conclusion f € CR?) is false, by proving that 


f(h, 0) + f(=h, 0) — FO, 0) 


2 —oash—0. 


This shows that > cannot be replaced by > in (1) of Theorem 7.25. 
Suppose u is a distribution in R" whose first derivatives D,u, ..., D,u are func- 
tions in 7(R"). Prove that u is also a function and that u is locally 2’. (Show that 
“locally” cannot be omitted in the conclusion.) Hint: u is in fact the sum of an 
P?-function and an entire function. 

When n = 1, show that u is actually a continuous function. Show that this 
stronger conclusion is false when n = 2. For example, the gradient of the func- 
tion 


; 1 
u(re’’) = log log (2 + -) 
r 


is in [?(R). See Exercise 11, Chapter 8, for the same result under weaker 
hypotheses. 
Periodic distributions, or distributions on a torus JT", have Fourier series whose 
theory is somewhat simpler than that of Fourier transforms. This is mainly due 
to the compactness of T”": Every distribution on T”" has compact support. In 
particular, tempered distributions are nothing special. 

Prove the various assertions made in the following basic outline: 


TP ee (CP sie"). x real) 


Functions ¢ on T" can be identified with functions ¢ on R" that are 2a-periodic 
in each variable, by setting 


P(X4, --+5 X,) = P(e, ..., ef). 


Z" is the set (or additive group) of n-tuples k = (k,, ..., k,) of integers k,. For 
k € 2", the function e, is defined on T” by 


e,(e™!, ..., e") = e*' * = exp {i(k,x, +--+ k,x,)}. 
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c, is the Haar measure of T". If ¢ € L'(a,), the Fourier coefficients of @ are 


P(k) = | e-,pdo, (ke Z"). 
= 


YT") is the space of all functions ¢ on T” such that ¢ e C%(R’). If ¢ € AT") 
then 


1/2 
Y (+k: pri swr < oO 
keZ" 

for N = 0, 1, 2, .... These norms define a Fréchet space topology on A(T"), 
which coincides with the one given by the norms 


max sup |(D*¢\x)| (N =0, 1, 2,...). 
ja|<N xeER" 
Q(T") is the space of all continuous linear functionals on A(T"); its members 
are the distributions on T". The Fourier coefficients of any u € Y'(T") are defined 
by 
i(k) = u(e_,) (k € 2"). 


To each u € YT") correspond an N and a C such that 


| a(k)| < CI + Ik)" (k € 2"). 


Conversely, if g is a complex function on Z" that satisfies | g(k)| < C(1 + |k|)% 
for some C and N, then g = i for some u € YT"). 

There is thus a linear one-to-one correspondence between distributions 
on T", on one hand, and functions of polynomial growth on Z”, on the other. 

If E, cE, cE, c::: are finite sets whose union is Z", and if ue D(T"), 
the “ partial sums ” 

2. u(k)e, 
ke Ej 

converge to u as j > 00, in the weak*-topology of ZT”). 

The convolution u * v of ue YT") and v € YT") is most easily defined 
as having Fourier coefficients i(k)i(k). The analogues of Theorems 6.30 and 6.37 
are true; the proofs are much simpler. 
Modify the proof of Theorem 7.25 so that Fourier series are used in place of 
Fourier transforms, by replacing F by a suitable periodic function. 
Put c = (2/z)"*. For j = 1, 2, 3, ..., define g, on the real line by 


c/t if i/j<|t| <j 
g(t) = 
0 otherwise. 


Prove that {g;} is a uniformly bounded sequence of functions which converges 
pointwise, as j > oo. If fe L’°(R'), it follows that f* g, converges, in the L’- 
metric, to a function Hf € L’. This is the Hilbert transform of f; formally, 


1 oO 
(Hf \x) = - | on dt. 


— @ 
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(The integral exists, in the principal value sense, for almost every x, but this is 


not so easy to prove; if f satisfies a Lipschitz condition of order 1, for instance, 
the proof is trivial.) Prove that 


IAF ll. = If lle and H(Af) = —f, 


for every f € 7(R'). Thus H is an L’-isometry, of period 4. 
Is it true that Hfe Y, iffe S,? 


CHAPTER 


APPLICATIONS TO 
DIFFERENTIAL 
EQUATIONS 


Fundamental Solutions 


8.1 Introduction We shall be concerned with linear partial differential 
equations with constant coefficients. These are equations of the form 


(1) P(D)u = v 


where P is a nonconstant polynomial in n variables (with complex 
coefficients), P(D) is the corresponding differential operator (see Section 7.1), 
v is a given function or distribution, and the function (or distribution) u is a 
solution of (1). 

A distribution E € 9’(R") 1s said to be a fundamental solution of the 
operator P(D) if it satisfies (1) with.v = 6, the Dirac measure: 


(2) P(D)E = 6. 


The basic result (Theorem 8.5, due to Malgrange and Ehrenpreis) that will 
be proved here is that such fundamental solutions always exist. 

Suppose we have an E that satisfies (2), suppose v has compact 
support, and put 


(3) u=E * v. 
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Then wu 1s a solution of (1), because 
(4) P(D\E * v) = (P(D)E) * v= 6 * v=, 


by Theorems 6.35 and 6.37. 

The existence of a fundamental solution thus leads to a general exis- 
tence theorem for the equation (1); note also that every solution of (1) 
differs from E * v by a solution of the homogeneous equation P(D)u = 0. 
Moreover, (3) gives some additional information about u. For instance, if 
v € Y(R"), then u € C%(R"). 

It may of course happen that the convolution E * v exists for certain v 
whose support is not compact. This raises the problem of finding E so that 
its behavior at infinity 1s well under control. The best possible result would 
of course be to find an E with compact support. But this can never be done. 
If it could, E would be an entire function, and (2) would imply PE = 1. But 
the product of an entire function and a polynomial cannot be 1 unless both 
are constant. 

However, the equation PE = 1 can sometimes be used to find E, 
namely, when 1/P is a tempered distribution; in this case, the Fourier trans- 
form of 1/P furnishes a fundamental solution which is a tempered distribu- 
tion. For examples of this, see Exercises 5 to 9. 

Another related question concerns the existence of solutions of (1) 
with compact support if the support of v is compact. The answer (given in 
Theorem 8.4) shows very clearly that it is not enough to study P on R” in 
problems of this sort but that the behavior of P in the complex space ¢” is 
highly significant. 


8.2 Notations T” is the torus that consists of all points 


(1) w = (e, ..., e') 


in ¢”, where 0,, ..., 6, are real; o, is the Haar measure of T”, that is, 
Lebesgue measure divided by (27)". 
A polynomial in ¢", of degree N, is a function 


(2) P(z)= )) c(xz* (ze €"), 


lal<N 


where a ranges over multi-indices and c(«) € €. If (2) holds and if c(«) # 0 
for at least one « with | «| = N, P is said to have exact degree N. 


8.3. Lemma _ [If P is a polynomial in ¢", of exact degree N, then there is a 
constant A < oo, depending only on P, such that 


(1) | f(z)| < Ar~™ | [(fP\z + rw)| do,(w) 


for every entire function f in {", for every z € {", and for every r > 0. 
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PROOF. Assume first that F is an entire function of one complex vari- 
able and that 


(2) Q)=cT] +a) GeQ) 


Put Q,(A) =c I] (1 + a,A). Then cF(0) = (FQ,)(0). Since |Q,| =|Q| 
on the unit circle, it follows that 


1 . 
(3) | cF(0) | <x | |(FQ\e"*)| dé. 
The given polynomial P can be written in the form P= 


Py) + P,; +::: + Py, where each P; is a homogeneous polynomial of 
degree j. Define A by 


1 
(4 [= [i Py| doy. 


This integral is positive, since P has exact degree N. [See part (b) of 
Exercise 1.] If z € ¢" and w e€ T”, define 


(5) F(A) = f(z + rdw), Q(A) = P(z + rdw) (Ae ¢). 
The leading coefficient of Q is r’P,(w). Hence (3) implies 


a 
(6) r™ | Px(w) || f(2)| S on i |(fP)(z + re’w)| dé. 


If we integrate (6) with respect to o,,, we get 


 — If@l< ar = 7 do | |(fP\e + relw)| do,(). 


The measure o, is invariant under the change of variables 
w > ew. The inner integral in (7) is therefore independent of 6. This 


gives (1). //// 


Theorem Suppose P is a polynomial in n variables, v € 2'(R"), and v 


has compact support. Then the equation 


(1) 


P(D)u = v 


has a solution with compact support if and only if there is an entire function g 
in €" such that 


(2) 


Pg =v. 


When this condition is satisfied, (1) has a unique solution u with compact 


support ; the support of this u lies in the convex hull of the support of v. 
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PROOF. If (1) has a solution u with compact support, (a) of Theorem 
7.23 shows that (2) holds with g = u. 

Conversely, suppose (2) holds for some entire g. Choose r > 0 so 
that v has its support in rB = {x € R": |x| <r}. By Lemma 8.3, (2) 
implies 


(3) | g(z)| < A | |o(z + w)| do,w) (ze €"). 
Tn 
By (a) of Theorem 7.23, there exist N and y such that 


(4) |i(z + w)| < p11 + [z+ wl)% exp {r|Im (z + w)|}. 


There are constants c, and c, that satisfy 


(5) 1+|z+w|<c,0 4+ |z]) 
and 
(6) [Im (z + w)| <c, + |Im z| 


for all z € ¢” and all w € T”. It follows from these inequalities that 
(7) |g(z)| < BL +|z|)" exp {r|Imz|} (ze ¢"), 


where B is another constant (depending on y, A, N, c,, c,, and r). By 
(7) and (b) of Theorem 7.23, g =u for some distribution u with 
support in rB. Hence (2) becomes Pi = 0, which is equivalent to (1). 

The uniqueness of u is obvious, since there is at most one entire 
function ui that satisfies Pu = 0. 

The preceding argument showed that the support S, of u lies in 
every closed ball centered at the origin that contains the support S, of 
v. Since (1) implies 


(8) P(D\t,u)=Tt,0 (x € R”), 


the same statement is true of x + S, and x + S,. Consequently, S, lies 
in the intersection of all closed balls (centered anywhere in R") that 
contain S,. Since this intersection is the convex hull of S,, the proof is 


complete. /T// 


8.5 Theorem If P is a polynomial in €, of exact degree N, and if r > 0, 
then the differential operator P(D) has a fundamental solution E that satisfies 


(1) | E()| < Ar~™ | do,(w) | | p(t + rw)| dm,(0) 


for every W € D(R"). 
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Here A is the constant that appears in Lemma 8.3. The main point of 
the theorem is the existence of a fundamental solution, rather than the esti- 
mate (1) which arises from the proof. 


PROOF. Fix r > 0, and define 


(2) Iv = | do,(w) | | W(t + rw)| dm,(t) 
Tn Rn 
In preparation for the main part of the proof, let us first show that 
(3) lim ||y,|| = 0 ify; +0 in DR"). 
jroo 


Note that J(t + w) = (e_,, w)*(t) if t € R" and we ¢”. Hence 


(4) ly -| do.) | |(C--wW)* | dm, 


If y,; +0 in A(R"), all y; have their supports in some compact set K. 
The functions e,,, (w € T”) are uniformly bounded on K. It follows 
from the Leibniz formula that 


(5) || D*(e _ nw W dll oo Ss C(K, at) max || DPW ill o- 


The right side of (5) tends to 0, for every a. Hence, given ¢ > 0, there 
exists j, such that 


(6) I — Ay"e_w¥idle<€ (W>jo.weT"), 


where A= D?+---+ D? is the Laplacian. By the Plancherel 
theorem, (6) is the same as 


(7) | [1 + [t))"h ft + rw)|? dm,(t) < &, 


from which it follows, by the Schwarz inequality and (2), that 
|W | < Ce for all j > jo, where 


(8) C= | (1 + |t|?)~ 2" dm,(t) < oo. 
Rn 
This proves (3). 
Suppose now that @ € BR") and that 
(9) y = P(D)¢. 


Then p = Pd, ¢ and Ww are entire, hence determines @. In particular, 
@(0) is a linear functional of w, defined on the range of P(D). The crux 
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of the proof consists in showing that this functional is continuous, i.e., 
that there is a distribution u € @'(R") that satisfies 


(10) u(P(D)?) = (0) (pe A(R"), 
because then the distribution E = u satisfies 
(P(D)E\(d) = E(P(— D)d) = u((P(— D)¢)") 
= u(P(D)b) = $(0) = $(0) = 6(9), 


so that P(D)E = 0, as desired. 
Lemma 8.3, applied to Pd = y, yields 


(11) | d(t)| < ar" | \W(t + rw)| do,(w) — (t € R”). 


By the inversion theorem, (0) = ) zn @ dm,. Thus (11), (2), and (9) give 
(12) |$(0)| < Ar™™||P(D)P|| — (@ € AR"). 

Let Y be the subspace of Y(R") that consists of the functions 
P(D)¢, @ € ZR"). By (12), the Hahn-Banach theorem 3.3 shows that 


the linear functional that is defined on Y by P(D)¢ — (0) extends to a 
linear functional u on Z(R") that satisfies (10) as well as 


(13) lu(y)|< Arp = € A(R’). 
By (3), u € 9(R"). This completes the proof. //// 


Elliptic Equations 


8.6 Introduction If u is a twice continuously differentiable function in 
some open set Q < R? that satisfies the Laplace equation 
O7u— 0*u 


Ox 

then it is very well known that u is actually in C%(Q), simply because every 
real harmonic function in Q is (locally) the real part of a holomorphic func- 
tion. Any theorem of this type—one which asserts that every solution of a 
certain differential equation has stronger smoothness properties than is a 
priori evident—is called a regularity theorem. 

We shall give a proof of a rather general regularity theorem for elliptic 
partial differential equations. The term “elliptic ” will be defined presently. 
It may be of interest to see, first of all, that the equation 


Cru 
Ox Oy 


(1) = 0, 


(2) 


behaves quite differently from (1), since it is satisfied by every function u of 
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the form u(x, y) = f(y), where fis any differentiable function. In fact, if (2) is 
interpreted to mean 


0 (Ou 
° (ax) 2° 


then fcan be a perfectly arbitrary function. 


8.7 Definitions Suppose Q is open in R", N is a positive integer, 
f, € C°(Q) for every multi-index « with |«| < N, and at least one f, with 
|a| = N is not identically 0. These data determine a linear differential oper- 
ator 


(1) L= ) fad, 

lal <N 
which acts on distributions u € Y’(Q) by 
(2) Lu= > f,D,u. 

Jal <N 

The order of L is N. The operator 
(3) » SD. 
ja|/=N 
is the principal part of L. The characteristic polynomial of L is 
(4) Ax y= Dd fdy® (x €Q, ye R’). 
ja/=N 


This is a homogeneous polynomial of degree N in the variables y = 
(V1,---> Y,), With coefficients in C°(Q). 

The operator L is said to be elliptic if p(x, y) # 0 for every x € Q and 
for every y € R", except, of course, for y = 0. Note that ellipticity is defined 
in terms of the principal part of L; the lower-order terms that appear in (1) 
play no role. 

For example, the characteristic polynomial of the Laplacian 


0G? 0? 
5 Rab ences oe a She 
©) Ox? 7_ a 0x? 
is p(x, y) = —(yj7 + -°:: + y?), so that A is elliptic. 


On the other a if L = 07/0x, Ox, then p(x, y) = —y,yz, and L is 
not elliptic. 

The main result that we are aiming at (Theorem 8.12) involves some 
special spaces of tempered distributions, which we now describe. 


8.8 Sobolev spaces Associate to each real number s a positive measure 
uu, on R" by setting 


(1) duy) = (1 + | y|*)° dm,(y). 


CHAPTER 8: APPLICATIONS TO DIFFERENTIAL EQUATIONS 217 


If fe P(u,), that is, if [| f|? du, < oo, then f is a tempered distribution 
[Example (c) of Section 7.12]; hence f is the Fourier transform of a tem- 
pered distribution u. The vector space of all u so obtained will be denoted 
by H*; equipped with the norm 


1/2 
(2) lull, = (| | |? iu.) 


H’ is clearly isometrically isomorphic to [7(u,). 

These spaces H* are called Sobolev spaces. The dimension n will be 
fixed throughout, and no reference to it will be made in the notation. 

By the Plancherel theorem, H° = LP’. 

It is obvious that H* c H' if t <s. The union X of all spaces H’* is 
therefore a vector space. A linear operator A: X — X 1s said to have order t 
if the restriction of A to each H* is a continuous mapping of H* into H*'; 
note that t need not be an integer and that every operator of order t also 
has order t’ if t’ > t. 

Here are the properties of the Sobolev spaces that will be needed. 


8.9 Theorem 


(a) Every distribution with compact support lies in some H*. 
(b) If —w <t < o, the mapping u - v given by 
iy) =(1 + [yay (ye R") 


is a linear isometry of H* onto H*' and is therefore an operator of order 
t whose inverse has order —t. 

(c) If be L*(R"), the mapping u—v given by t = bi is an operator of 
order (. 

(d) For every multi-index «, D, is an operator of order | «|. 

(e) Iffe F,, then u — fu is an operator of order 0. 


PROOF. If u € Y’(R") has compact support, (a) of Theorem 7.23 shows 
that 


(1) Jay); < CA +ly)X (eR, 


for some constants C and N. Hence ue H* if s< —N —n/2. This 
proves part (a); (b) and (c) are obvious. The relation 


|(D,u)(y)| = ly“ ay) | < + Ly Py"? | ay) | 
implies 
(2) [Dull s— ja Ss lull,, 
so that (d) holds. 
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The proof of (e) depends on the inequality 


(3) (1+ [x+y < 251 + |x? + [yy 
valid for x eR", ye R", —o <s<o. The case s=1 of (3) is 
obvious. From it the case s = —1 is obtained by replacing x by x — y 


and then y by —y. The general case of (3) is obtained from these two 
by raising everything to the power | s|. It follows from (3) that 


(4) | [h(x — y)|? du(x) < 2''(1 + | y|?)'" | |h|? dy, 


for every measurable function h on R". 

Now suppose ue H’, fe Y,, t>|s|+n/2. Since fe F,, 
If ll, < co. Put y = yw), ,(R"). Then y < 00. Define F =|i| *|f|. By 
Theorem 7.19, 


(5) \(fu)"| =|a@*f| <|al«|fl=F. 
By the Schwarz inequality, 


(6) |FQx)|? < | IF? duly) | | a(x — y)|? du_,(y) 
Rn Rn 
for every x € R”. Integrate (6) over R", with respect to p,. By (4), the 
result is 
(7) | |F |? du, < 2! 'y|| fll? lulls. 
Rn 
It follows from (5) and (7) that 
(8) I fu |, < (2)? UF llellual- 
This proves (e). //// 


8.10 Definition Let Q be open in R”. A distribution u € Y’(Q) is said to 
be locally H* if there corresponds to each point x € Q a distribution v € H* 
such that u = v in some neighborhood a of x. (See Section 6.19.) 


8.11 Theorem If ue 9(Q) and —w <s < «, the following two state- 
ments are equivalent: 


(a) wis locally H°. 
(b) wue H* for every w € DQ). 

Moreover, if s is a nonnegative integer, (a) and (b) are equivalent to 
(c) D,uis locally L? for every « with |a|< s. 
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Statement (b) may need some clarification, since u acts only on test 
functions whose supports lie in 2. However, wu is the functional that 
assigns to each @ € Y(R") the number 


(yup) = uly). 
Note that yd € BQ), so that u(w¢) is defined. 


PROOF. Assume u is locally H*. Let K be the support of some 
Ww € DQ). Since K is compact, there are finitely many open sets w,; ¢ 

Q, whose union covers K, and in which u coincides with some vu; € H*. 
There exist functions w;¢ Dw, such that ) ¥;=1 on K. If 
bd € Y(R") it follows that 


up) = > uid) = > ob), 

since W,wd € Dw). Thus pu=)> wW;,v;. By (e) of Theorem 8.9, 
Ww, wv, € H* for each i. Thus wu € H°, and (a) implies (b). 

If (b) holds, if x € Q, and if w € DQ) is 1 in a neighborhood w of 
x, then u = Wu in w, and Wu € H* by assumption. Thus (b) implies (a). 

Assume again that (b) holds. If WW € GQ), then wu € H*, hence 
DAwu) € HS~'*!, by (d) of Theorem 8.9. If |a| < s, then 

Hs7~'¢! c H®° = P(R"). 


Thus D,(ywu) € L’(R"). Taking wy = 1 in some neighborhood of a point 
x € QO shows that D, u is locally ? in Q. Thus (b) implies (c). 

Finally, assume D,u is locally I? for every a with |a| <s. Fix 
Ww € Z(Q). The Leibniz formula shows that D,(wu) € P-(R") if |a| <s. 
Hence 


(1) | Ly |? [(u)*(y) |? dm,(y)< co (Ja| <3). 


If s is a nonnegative integer, (1) holds with the monomials yj, ..., ys 1n 
place of y*. It follows, as in the proof of Theorem 7.25, that 


(2) ix +l yl?) 1(Wu)*(y)? dm,(y) < 0c. 


Thus wu e€ H’*, (c) implies (b), and the proof is complete. //// 


8.12 Theorem Assume Q is an open set in R", and 


(a) L=)_ f,D, is a linear elliptic differential operator in Q, of order N > 1, 
with coefficients f, € C°(Q), 


(b) for each a with |a| = N, f, is a constant, 
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(c) uand vare distributions in Q that satisfy 
(1) Lu = 2, 
and v is locally H°. 


Then u is locally H8*%. 


Corollary. If L satisfies (a) and (b) and if v € C®(Q), then every solution 
u of (1) belongs to C®(Q). In particular, every solution of the homoge- 
neous equation Lu = 0 is in C°Q). 


For if ve C%(Q), then wu € OR") for every w € DQ); hence v is 
locally H* for every s, and the theorem implies that u is locally H* for every 
s; it follows from Theorems 8.11 and 7.25 that ue C°(Q). 

Assumption (b) can be dropped from the theorem, but its presence 
makes the proof considerably easier. 


PROOF. Fix a point x € Q, let By c Q be a closed ball with center at x, 
and let dy € AQ) be 1 on some open set containing B,. By (a) of 
Theorem 8.9, @,u eH’ for some t. Since H' becomes larger as t 
decreases, we may assume that t =s + N —k, where k is a positive 
integer. Choose closed balls 


Bo > By DD Be, 


each centered at x, and each properly contained in the preceding one. 
Choose @¢,,..., d, € BQ) so that ¢; = 1 on some open set containing 
B;, and ¢; = 0 off B;_,. Since @,u € H’, the following “bootstrap” 
proposition implies that 

gue H'*',...,6,ue H**. 


It therefore leads to the conclusion that u is locally H**™, because 
t+k=s+N and ¢, =1 on B,. //// 


Proposition. If, in addition to the hypotheses of Theorem 8.12, wu € H' 
for some t<s+N —1 and for some W € BQ) which is 1 on an open 
set containing the support of a function @ € BQ), then du € H'*?. 
PROOF. We begin by showing that 
(2) L(@u) € H'ON*?, 

Consider the distribution 


(3) A = L(gu) — @Lu = L(pu) — ov. 
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Since its support lies in the support of ¢, u can be replaced by wu in 
(3), without changing A: 


(4) A=Lippu) — oL(yu) = | d Se [D(owu) — oD (yu)]. 

If the Leibniz formula is applied to D,(¢@ - wu), one sees that the deriv- 
atives of order N of wu cancel in (4). Therefore A is a linear com- 
bination [with coefficients in Z(R")] of derivatives of wu, of orders at 
most N — 1. Since wu € H’, parts (d) and (e) of Theorem 8.9 imply that 
A € H'-N*!. By Theorem 8.11, dv € H’, and since t— N + 1 <s, we 
have ov € H'~%*?. Now (2) follows from (3). 

Since L is elliptic, its characteristic polynomial 


(5) p= > fy® (ye R’) 


lal =N 


has no zero in R", except at y = 0. Define functions 


(6) ay)=lyl*py), = ry) = (1 + ly Ia), 


for ye R", y £0, and define operators Q, R, S on the union of the 
Sobolev spaces by 


(7) (Qw)* = qv, (Rw)* = rw 
and 
(8) S= | 2. Wa D,. 


Since p is a homogeneous polynomial of degree N, q(Ay) = q(y) if 
A > 0, and since p vanishes only at the origin, the compactness of the 
unit sphere in R” implies that both gq and 1/q are bounded functions. 
It follows from (c) of Theorem 8.9 that both Q and Q™' are operators 
of order 0. 

Since both (1+ |y|?)~4/7(1 + |y|%) and its reciprocal are 
bounded functions on R", it follows from the preceding paragraph, 
combined with (b) and (c) of Theorem 8.9, that R is an operator of 
order N whose inverse R~' has order —N. 

Since Wf, € BR") it follows from (d) and (e) of Theorem 8.9 that 
S is an operator of order N — 1. 

Since p = r — q, and since p is assumed to have constant coeffi- 
cients f,, we have 


(9) ( 2 Iq Da ») = pw = (r — q)w = (Rw — Qw)" 


if w lies in some Sobolev space. Hence 


(10) (R — Q + Sou) = L(pu). 
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By (2), L(gu) € Ht N*?, 
Since wue H' and dw = @, (e) of Theorem 8.9 implies that 
gu = dwu € H'. Hence 


(11) (Q — S\(pu) ¢ H'*™?, 

because Q has order 0 and S has order N — 12> 0. It now follows 
from (10) that 

(12) R(gu) € HtO™*?, 


and since R~' has order — N, we finally conclude that gu € H'**. //// 


8.13 Example Suppose L is an elliptic differential operator in R", with 
constant coefficients, and E is a fundamental solution of L. In the com- 
plement of the origin, the equation LE = 6 reduces to LE = 0. Theorem 
8.12 implies therefore that, except at the origin, E is an infinitely differentia- 
ble function. The nature of the singularity of E at the origin depends, of 
course, on L. 


8.14 Example The origin in R? is the only zero of the polynomial 
p(y) = y, + iy2. If Q is open in R?, and if u € JQ) is a distribution solu- 
tion of the Cauchy-Riemann equation 


Theorem 8.12 implies that ue C*%(Q). It follows that u is a holomorphic 
function of z = x, + ix, in Q. In other words, every holomorphic distribution 
is a holomorphic function. 


Exercises 


1. The following simple properties of holomorphic functions of several variables 
were tacitly used in this chapter. Prove them. 
(a) If fis entire in ¢”, if we ¢", and if @(A) = f(Aw), then @ is an entire function 
of one complex variable. 
(b) If P is a polynomial in ¢” and if 


| 1P\do,=0 
Tn 


then P is identically 0. Hint: Compute |, | P|? do,. 
(c) If P is a polynomial (not identically 0) and g is an entire function in ¢", then 
there is at most one entire function f that satisfies Pf = g. 
Find generalizations of these three properties. 
2. Prove the statement about convex hulls made in the last sentence of the proof of 
Theorem 8.4. 


3. 


4. 
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Find a fundamental solution for the operator 07/dx, 0x, in R*. (There is one 
that is the characteristic function of a certain subset of R?.) 
Show that the equation 


Ou Pu ; 
Ox? 0x2 


is satisfied (in the distribution sense) by every locally integrable function u of the 
form 


U(X, X2) = f(x, + x2) or u(xX4, X2) = f(x, — x) 


and that even classical solutions (i.e., twice continuously differentiable functions) 
need not be in C®. Note the contrast between this and the Laplace equation. 


. For x € R®, define f(x) = (1 + |x|?)~!. Show that fe 2(R°) and that f is a fun- 


damental solution of the operator I — A in R*. Find f, by direct computation 
and also by the following reasoning: 
(a) Since f is a radial function (i.e., one that depends only on the distance from 
the origin) the same is true of f; see Exercise 1 of Chapter 7. 
(b) Away from the origin, (J — A) f = 0, and fe C®. 
(c) If F(|y|) =f(y), (b) implies that F satisfies an ordinary differential equation 
in (0, 00) that can easily be solved explicitly. 
Ans. f (y) = (n/2)'? |y|~* exp (—ly]). 
Do the same with R" in place of R°; you will meet Bessel functions. 


. For0 <2< nand x € R’, define 


K ,(x) = |x|74. 
Show that 
(a) K ,(y) = c(n, )K,-(y) (ye R”), 
where 


2n? “FT [(n — A)/2] 
c(n, A) = —————_-.——— 
T'(4/2) 
Suggestion: If n<2A<2n, K, is the sum of an L’-function and an 


L--function. For these A, Equation (a) can be deduced from the homogeneity 
condition 


K (tx) =t~*K,(x) (x € R", t > 0). 


The case 0<2A<n follows from the inversion theorem (for tempered 
distributions). A passage to the limit gives the case 2A = n. The constants c(n, A) 
can be computed from j fo= ) fo, with d(x) = exp (—|x|?/2). 


. Take n > 3 and A = 2 in Exercise 6, and deduce that —c(n, 2)K,_, is a funda- 


mental solution of the Laplacian A in R". For example, if v has compact support 
in R*, show that a solution of Au = v is given by 


1 
u(x) = -Z | bert) dy. 
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8. Identify R* and ¢ (so that z = x, + ix,); put 


0 0 x ig 0 
oC = ——_ — | seine O =< j eer te 
Ox, a 0x4 
Show that the Fourier transform of 1/z (regarded as a tempered distribution) is 
—i/z. Show that this result is equivalent to the Cauchy formula 


dm,(w) 


(2) = | (66\w) [od € AR*)]. 


Ww 


Since 0 log | w| = 1/w and A = 00, deduce that 


(2) = | (A¢\w) log |w—z|dm(w) [ € HR’)]. 


Thus log |z| is a fundamental solution of the Laplacian in R?. 
9. Use Exercise 6 to compute that 


lim [e~1-—b—K,_,y)]=logly| (ye R?), 


6-0 


where b is a certain constant. Show that this leads to another proof of the last 
statement in Exercise 8. 


10. Suppose P(D) = D* + aD + bI. (We are now in the case n = 1.) Let f and g be 
solutions of P(D)u = 0 which satisfy 


f0)=90) and f'(0)—g(0)=1. 
Define 


_ JF) if x <0, 
o> oe ifx > 0, 


and put 


Noite {" Hx)G(x) dx [6 € BRI] 


Prove that A is a fundamental solution of P(D). 


11. Suppose u is a distribution in R" whose first derivatives D,u, ..., D,u are locally 
L?. Prove that u is then locally I’. Hint: If y € D(R") is 1 in a neighborhood of 
the origin and if AE = 6, then A(wE) — 6 € D(R"). Hence 


u— 2: (D,u) * D(WE) 


is in C®(R"). Each D{wE) is an L’-function with compact support. 
12. Suppose u is a distribution in R" whose Laplacian Au is a continuous function. 
Prove that u is then a continuous function. Hint: As in Exercise 11, 


u — (WE) * (Au) € C®(R"). 


13. Prove analogues of Exercises 11 and 12, with R” replaced by an arbitrary open 
set 02. 
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14. Show, under the hypotheses of Exercise 12, that 


15. 


16. 


(a) 67u/Ox? is locally L7, but 
(b) 0*u/Ox? need not be a continuous function. 

Outline of (b) for periodic distributions in R? (Exercise 22, Chapter 7): If 
g € C(T’) has Fourier coefficients g(m, n) and if fis defined by 


f(m, n) = (1 + m? + n?)~'g(m, n), 


then fe C(T*) and Af =f — g € C(T”), since ¥ | f(m, n)| < oo. The Fourier coef- 
ficients of 07f/dx? are —m?f(m,n). If 67f/@x? were continuous for every 
ge C(T’), then (07f/Ax?X0, 0) would be a continuous linear functional of g. 
Hence there would be a complex Borel measure p on T?, with Fourier coeffi- 
cients 


m2 


X(m, n) = ————_——--.. 
Bee) 1 +m? +n? 


The next exercise shows that no such measure exists. 
If 1 is a complex Borel measure on T”, and if 


A B 


1 , 
14, B= aa ap a) =, & fhe 


prove that 


lim im (A, B)| = lim tim (A, | 
A>aw LB-ow B>«o LA~>o 

Suggestion: If D,(t)=(24+1)7' ¥4,e™, then D,(x)=1 if x =0, 
D ,(x) > 0 otherwise, and 


(A, B) = | D 4(x)D gy) du(x, y). 


Conclude that each of the two iterated limits exists and that both are equal to 
p({0, 0}). 

If u were as in Exercise 14, one of the iterated limits would be 1, the 
other 0. 
Suppose L is an elliptic linear operator in some open set 2 < R”, and suppose 
that the order of L is odd. 
(a) Prove that then n = 1 orn = 2. 
(b) If n = 2, prove that the coefficients of the characteristic polynomial of L 

cannot all be real. 

In view of (a), the Cauchy-Riemann operator is not a very typical example 

of an elliptic operator. 


CHAPTER 


TAUBERIAN 
THEORY 


Wiener’s Theorem 


9.1 Introduction A tauberian theorem is one in which the asymptotic 
behavior of a sequence or of a function is deduced from the behavior of 
some of its averages. Tauberian theorems are often converses of fairly 
obvious results, but usually these converses depend on some additional 
assumption, called a tauberian condition. To see an example of this, consider 
the following three properties of a sequence of complex numbers 
Sy = Ag t+’ +4,. 


(a) lim s,=s. 


no 


(b) Tff(r)= y a,r", 0<r< 1, then lim f(r) =s. 


r>1 
(c) lim na, = 0. 
Since f(r) = (1 —r) >} s,r" and (1 —r) > r" = 1, f(”) is, for each r € (0, 1), an 
average of the sequence {s,}. It is extremely easy to prove that (a) implies 


(b). The converse is not true, but (b) and (c) together imply (a); this is also 
quite easy and was proved by Tauber. The tauberian condition (c) can be 
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replaced by the weaker assumption that {na,} is bounded (Littlewood). It is 
remarkable how much more difficult this weakening of (c) makes the proof. 

Wiener’s tauberian theorem deals with bounded measurable functions, 
originally on the real line. If @ € L*(R) and if d(x) > 0 as x > +00, then it 
is almost trivial that (K * @)\(x) > 0 as x > +00 for every K € L’(R). The 
convolutions K « @ may be regarded as averages of @, at least when 
) K = 1. Wiener’s converse [(a) of Theorem 9.7] states that if (K * ¢)(x) > 0 
for one K e€ L'(R) and if the Fourier transform of this K vanishes at no 
point of R, then (f * ¢)(x) > 0 for every fe L(R); the stronger conclusion 
that ¢(x)-— 0 need not hold under these hypotheses, but it does hold if a 
slight additional condition (slow oscillation) is imposed on @ [(b) of 
Theorem 9.7]. 

The unexpected tauberian condition—the nonvanishing of K—enters 
the proof in the following manner: If (K * ¢)(x) — 0, the same ts true if K 1s 
replaced by any of its translates, hence also if K is replaced by any finite 
linear combination g of translates of K. When K has no zero, it turns out 
that the set of these functions g is dense in L' (Theorem 9.5). One is thus led 
to the study of translation-invariant subspaces of L’. 


9.2 Lemma _ Suppose fe L'(R"), te R", and ¢>0. Then there exists 
h € L'(R"), with ||h||, < @, such that 


(1) h(s) = f(t) —f(s) 


for all s in some neighborhood of t. 


The lemma states that fis approximated, in the L’-norm, by a function 
f +h whose Fourier transform is constant in a neighborhood of the point t. 


PROOF. Choose g € L'(R") so that g = 1 in some neighborhood of the 
origin. For 4 > 0, put 


(2) g(x) =e" *A~"g(x/A) (x € R") 
and define 
(3) h,(x) =f (g(x) — (f * gaXt). 


Since g,(s) = 1 in some neighborhood VJ, of t, (3) shows that (1) holds 
for s € V,, with h, in place of h. Next, 


(4) h(x) = | f(y)Le~* g(x) — g(x — y)] dm,(y). 


The absolute value of the expression in brackets is 


(5) [A~"g(A~ tx) — Am "g(A "(x — y)) |. 
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It follows that 


(6) = Wlhalls S | | f(y) | dm,(y) | | (Z) — g(E — A *y)| dm,(¢), 


by the change of variables x = Aé. The inner integral in (6) is at most 
2\\g||,, and it tends to 0 for every y € R", as A > o. Hence |\h,||, — 0, 
as A — oo, by the dominated convergence theorem. ITT 


Theorem If ¢ € L°(R"), Y is a subspace of L'(R"), and 
f*o=0 for every fe Y, 


then the set 


(2) 


Z(Y) = a {se R": f(s) = 0} 


contains the support of the tempered distribution @. 


9.4 


PROOF. Fix a point t in the complement of Z(Y). Then f(t) = 1 for a 
certain fe Y. Lemma 9.2 furnishes h € L'(R"), with ||h||, < 1, such that 
h(s) = 1 — f(s) in some neighborhood V of t. 

To prove the theorem, it suffices to show that d = 0 in V, or, 
equivalently, that é() = 0 for every wy € Y, whose Fourier transform 
y has its support in V. Since 


(3) dh) = OW) = (¢ * WO), 


it suffices to show that @ * yw = 0. 

Fix such a yw. Put g.=W, g, =h* Gm, for m>1. Then 
Gms < ANTI Wily, and since ||h\|, <1, the function G=Y g,, is in 
L'(R"). Since h(s) = 1 — f(s) on the support of pW, we have 


(4) (1 — h(s)(s) = Y(s)f(s) (se R”), 

or 

0 = ¥, ff = Gf 

Thus wy = G « f, and (1) implies 

(6) yx hb=GefxG=0. /[/// 


Wiener’s theorem If Y is a closed translation-invariant subspace of 


L’(R") and if Z(Y) is empty, then Y = L(R’). 


PROOF. To say that Y is translation-invariant means that t, fe Y if 
fe Y and x € R". If ¢ € L°(R") is such that | fd = 0 for every fe Y, 
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the translation-invariance of Y implies that f « @ = 0 for every fe Y. 
By Theorem 9.3, the support of the distribution ¢ is therefore empty, 
hence ¢@ = 0 (Theorem 6.24), and since the Fourier transform maps 
SF, to £ in a one-to-one fashion (Theorem 7.15), it follows that @ = 0 
as a distribution. Hence ¢ is the zero element of L*(R"). 

Thus Y* = {0}. By the Hahn-Banach theorem, this implies that 


Y = L(R’). [/// 


9.5 Theorem Suppose K € L'(R") and Y is the smallest closed trans- 
lation-invariant subspace of L'(R") that contains K. Then Y = L(R") if and 
only if K(t) # 0 for every t € R". 


PRooF. Note that Z(Y) = {t € R": K(t) = 0}. The theorem thus asserts 
that Y = L{(R") if and only if Z(Y) is empty. One-half of this is 
Theorem 9.4; the other half is trivial. //// 


9.6 Definition A function ¢ € L*(R’") is said to be slowly oscillating if to 
every € > O correspond an A < o anda o > 0 such that 


(1) |P(x)-— AY)|<e if |x| >A, |y| > A, |x — yl <6. 


If n = 1, one can also define what it means for @ to be slowly oscil- 
lating at + 00: the requirement (1) is replaced by 


(2) | d(x) — d(y)| <é ifx >A, y>A,|x—y|<o. 


The same definition can of course be made at — oo. 
Note that every uniformly continuous bounded function is slowly 
oscillating but that some slowly oscillating functions are not continuous. 
We now come to Wiener’s tauberian theorem; part (b) was added by 
Pitt. 


9.7 Theorem 


(a) Suppose ¢ € L°(R"), K € L(R"), K(t) ¥ 0 for every t € R", and 


(1) lim (K * $)(x) = aK(0). 
|x| — 00 
Then 
(2) oes (f * p(x) = af (0), 


for every f €¢ '(R"). 
(b) If, in addition, ¢ is slowly oscillating, then 


(3) lim (x) = a. 


|x| — oo 
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PROOF. Put w(x) = ¢(x) — a. Let Y be the set of all fe L'(R") for 
which 


(4) im (f « W\(x) = 0. 


It is clear that Y is a vector space. Also, Y is closed. To see this, 
suppose f; € Y, || f—f;||, 7 0. Since 


(5) Ife —fixWllo SIF -Sfililvlo, 
f.* w—f* Ww uniformly on R"; hence (4) holds. Since 
(6) ((t, f) * Wx) = (t,(F * Wx) = F * Wx — 9), 


Y is translation-invariant. Finally, K € Y, by (1), since K * a = aK(0). 
Theorem 9.5 now applies and shows that Y = L'(R"). Thus every 
f € L(R") satisfies (4), which is the same as (2). This proves part (a). 
If ¢ is slowly oscillating and if ¢ > 0, choose A and 6 as in Defi- 
nition 9.6, and choose fe L'(R") so that f > 0, f(0) = 1, and f(x) = 0 if 
|x| = d. By (2), 


(7) lim (f * )\(x) = a. 


|x| — 00 


Also, 


(8) P(x) — F * Ox) = | 


ly| 


If |x| > A + 6, our choice of A, 6, and f shows that 


Lee) — &(x — y)] f(y) dm,(y). 


(9) | P(x) — (f * P\x)| <.e. 
Now (3) follows from (7) and (9). 
This completes the proof. Tf} 


Remark If n=1, Theorem 9.7 can be modified in an obvious 


fashion, by writing x — + 00 in place of |x| — oo wherever the latter occurs 
and by assuming in (b) that @ is slowly oscillating at + oo. The proof 
remains unchanged. 


The Prime Number Theorem 


9.9 


Introduction For any positive number x, 2(x) denotes the number 


of primes p that satisfy p < x. The prime number theorem is the statement 


that 


(1) 


. mx) log x 
im ——— = 


xa x 


1. 
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We shall prove this by means of a tauberian theorem due to Ingham, based 
on that of Wiener. The idea is to replace the rather irregular function z by a 
function F whose asymptotic behavior is very easily established and to use 
the tauberian theorem to draw a conclusion about z from knowledge of F. 


9.10 Preparation The letter p will now always denote a prime; m and n 
will be positive integers; x will be a positive number; [x] is the integer that 
satisfies x — 1 < [x] < x; the symbol d|n means that d and n/d are positive 
integers. Define 


log p  ifn=p, p’, p’,..., 
1 A(n) = 
) ") ‘o otherwise, 
(2) (x) = d A(n), 


3) F(x) = $ 4(2) 


The following properties of wy and F will be used: 


(4) (x) Z mx) log x 1 ‘ W(x) log x 

x x log x x log (x/log* x) 
if x > e, and 
(5) F(x) = x log x — x + b(x) log x, 


where b(x) remains bounded as x > oo. 
By (4), the prime number theorem is a consequence of the relation 


(6) ia ay 
x 


x 0 


which will be proved from (3) and (5) by a tauberian theorem. 


PROOF OF (4). [log x/log p] is the number of powers of p that do not 
exceed x. Hence 


log 
W(x)= > oe *) toe p < >. log x = n(x) log x. 
psx I og p<x 
This gives the first inequality in (4). If 1 < y < x, then 
log Pp __ Wx) 
mx)-mMyy= L) 1S Do —— 
y<p<x _y<p<x logy * logy’ 


Hence x(x) < y + W(x)/log y. With y = x/log’ x, this gives the second 
half of (4). Il 
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PROOF OF (5). If n > 1, then 


ro-rm- ne 5 {)-a P=) 


The mth summand is 0 except when n/m is an integer, in which case it 
is A(n/m). Hence 


F(n) — F(n — 1) = 5a(*) =) A(d) = log n. 


m|n d|n 


The last equality depends on the factorization of n into a product of 
powers of distinct primes. Since F(1) = 0, we have computed that 


(7) F(n)= > logm=log(n!) (n=1, 2, 3,...), 

m=1 
which suggests comparison of F(x) with the integral 
(8) J¢x) = | Yop t dt =x log x — x +1 

1 

Ifn<x<n+1 then 
(9) J(n) < F(n) < F(x) < F(n + 1) < J(n + 2) 
so that 
(10) | F(x) — J(x)| < 2 log (x + 2). 
Now (5) follows from (8) and (10). //// 


9.11 The Riemann zeta function As is the custom in analytic number 
theory, complex variables will now be written in the form s = o + it. In the 
half-plane o > 1, the zeta function is defined by the series 


(1) ((s)= Din’ 
n=1 
Since |n *| =n 7%, the series converges uniformly on every compact subset 


of this half-plane, and ¢ is holomorphic there. 
A simple computation gives 


Nt+1 N n+1 N 
| [xx *:dx Ss | x '"Sdx = Yn S—N(N +1). 
1 n=1 n n=1 
When o > 1, N(N + 1)" 0 as N > o. Hence 


(2) ((s)=s | bart dx  (a>1). 
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If b(x) = [x] — x, it follows from (2) that 


(3) C(s) = —— +s | b(x)x~*~§ dx (o > 1). 

1 
Since b is bounded, the last integral defines a holomorphic function in the 
half-plane o > 0. Thus (3) furnishes an analytic continuation of € to o > 0, 
which is holomorphic except for a simple pole at s = 1, with residue 1. The 
most important property we shall need is that ¢ has no zeros on the line 
o=1: 


(4) ¢(1 + it) #0 (—c0o <t< o). 
The proof of (4) depends on the identity 
(5) (s)=]]G-p)* (>). 


Since (1—p-‘)"'=1+p *%+p 75+4+-°-:, the fact that the product (5) 
equals the series (1) is an immediate consequence of the fact that every 
positive integer has a unique factorization into a product of powers of 
primes. Since )| p~° < 00 if o > 1, (5) shows that ¢(s) # 0 if o > 1 and that 


6 log (3) = Yi mtp-™ =~ (o > 1). 


Pp m=1 


Fix a real t # 0. If o > 1, (6) implies that 
(7) log |C°(a)C*(a + it)t(o + 2it)| 
= y, m—*p-™e Re {3 te 4p-‘™' + po imty > 0, 
p,m 


because Re (6 + 8e® + 2e7") = (e”/* + e /*)* > 0 for all real 6. Hence 


C(o + it)|* 


1 
|C(o + 2it)| >——_. 
o—l ao—l 


(8) l(a — 1)&(a)|° 


If ¢(1 + it) were 0, the left side of (8) would converge to a limit, 
namely, | ¢’(1 + it)|*|C(1 + 2it)|, as o decreases to 1. Since the right side of 
(8) tends to infinity, this is impossible, and (4) is proved. 


9.12 Ingham’s tauberian theorem Suppose g is a real nondecreasing 
function on (0, 00), g(x) = Oifx < 1, 


(1) G(x) = 5 o(*) (0 <x < 0), 


and 


(2) G(x) = ax log x + bx + xe(x), 
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where a, b are constants and &x) > 0 as x > oo. Then 
(3) lim x7 4g(x) = a. 


x oO 


If g is the function w defined in Section 9.10, Ingham’s theorem 
implies, in view of Equations (3) and (5) of Section 9.10, that (6) of Section 
9.10 holds, and this, as we saw there, gives the prime number theorem. 


PROOF. We first show that x~‘g(x) is bounded. Since g is nonde- 


creasing, 
g(x) — (3 *)< by (—1)"**g (*) - G(x) — 26%) 
= of log 2 + &x) — (=) < Ax, 


where A is some constant. Since 


g(x) = g(x) — (5) + (5) = (3) pee, 


it follows that 


(4) go) < A(x E+E 4 /) = 24x 


We now make a change of variables that will enable us to use 
Fourier transforms in a familiar setting. For — 0 < x < oo, define 


(5 A(x) = gle, HO) = Y A(x — log 0). 


n=1 
Then h(x) = 0 if x < 0, H(x) = G(e*); hence (2) becomes 
(6) H(x) = e*(ax + b + &,(x)) 
where ¢,(x) > 0as x > o. If 
(7) P(x) =e *h(x) (-wW<x< o), 
then ¢ is bounded, by (4). We have to prove that 
(8) lim $(x) = a. 


Put k(x) = [e*]Je ~, let A be a positive irrational number, and 
define 


(9) K(x) = 2k(x) — K(x —1)—Kx—d) (-w <x < ow), 
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Then K ¢€ L'(— 00, «); in fact, e*K(x) is bounded. (See Exercise 8.) If 
s=o+ it,o > 0, then formula (2) of Section 9.11 shows that 


00 00 00 1 
| k(x)e~ 5 dx = | [e*Je~ s+ 1) dx = | [y]y 275 dy = C( + s) 
= 0 1 l+s 


Repeat this with k(x — 1) and k(x — A) in place of k(x), use (9), and 
then let co > 0. The result is 


. — itx sx At — iat c(1 + it) 

(10) | eve dx =(2—e e '*') ae 
Since ¢(1 + it) 40 and since A is irrational, K(t) # 0 if t #0. Since ¢ 
has a pole with residue 1 at s = 1, the right side of (10) tends to1 +4 
as t 0. Thus K(0) # 0. 

To apply Wiener’s theorem, we have to estimate K * ¢. To 
do this, put u(x) = [e*], let v be the characteristic function of [0, 00), 
and let uw be the measure that assigns mass 1 to each point of the set 
{logn: n=1, 2, 3, ...} and whose support is this set. By (5), 
H =h» u. Also, u = v * pw. Hence 


(1) (hawe)ahev e WO) =(H +) = | “H(y) dy. 


(Note that we now take convolutions with respect to Lebesgue 
measure, not with respect to the normalized measure m,.) Since 


(p * kx) = . e” *h(x — y)Le”Je"” dy =e *(h * ul(x), 


— @ 


(6) and (11) imply that 


(12) (je x)= [0 ee eine eer rar tea 


where ¢,(x) > 0 as x > o. By (12) and (9), 


(13) lim (K * ¢\(x)=(1+ Aja=a [’ K(y) dy. 


xo — 0 


Therefore Wiener’s theorem 9.7 (see also Remark 9.8) implies that 


(14) lim (f * (x) =a | f(y) dy 
for every f € L'(— co, oo). 

Let f, and f, be nonnegative functions whose integral is 1 and 
whose supports lie in [0, ¢] and [ —e, 0], respectively. By (7), e*@(x) is 
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nondecreasing. Thus ¢@(y) < e’d(x) if x-—ex<y<x, and @(y)= 
e °d(x)ifx < y< x + «. Consequently, 


(15) ef, * PX) < P(x) < e(f2 * PY). 


It follows from (14) and (15) that the upper and lower limits of (x), as 
x — 00, lie between ae * and ae’®. Since ¢ > 0 was arbitrary, (8) holds, 
and the proof is complete. //// 


The Renewal Equation 


As another application of Wiener’s tauberian theorem we shall now give a 
brief discussion of the behavior of bounded solutions @ of the integral equa- 
tion 


#(x) — | 6% — 1) dult) = f(%) 


which occurs in probability theory. Here p is a given Borel probability 
measure, f is a given function, and @ is assumed to be a bounded Borel 
function, so that the integral exists for every x € R. The equation can be 
written in the form 


o—o*u=f, 
for brevity. 
We begin with a uniqueness theorem. 


9.13. Theorem [If u is a Borel probability measure on R whose support 
does not lie in any cyclic subgroup of R, and if ¢ is a bounded Borel function 
that satisfies the homogeneous equation 


(1) P(x) — (b * ux) = 0 


for every x € R, then there is a constant A such that (x) = A except possibly 
in a set of Lebesgue measure 0. 


PROOF. Since p is a probability measure, fi(0) = 1. Suppose that f(t) = 
1 for some t # 0. Since 


(2) f(t) = | e~'™ du(x), 
it follows that 4 must be concentrated on the set of all x at which 
e~'** = 1, that is, on the set of all integral multiples of 27/t. But this is 
ruled out by the hypothesis of the theorem. 

If o =0— yp, where 6 is the Dirac measure, then 6 = 1 — £. 
Hence a(t) = 0 if and only if t = 0, and (1) can be written in the form 


(3) o+a=0. 
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Put g(x) = exp (—x’); put K =g*o. Then Ke L, K(t)=0 
only if t = 0, and (3) shows that K * ¢@ = 0. By Theorem 9.3 (with the 
one-dimensional space generated by K in place of Y) the distribution 
@ has its support in {0}. Hence ¢ is a finite linear combination of 6 
and its derivatives (Theorem 6.25), so that ¢@ is a polynomial, in the 
distribution sense. Since nonconstant polynomials are not bounded 
on R, and since ¢ is assumed to be bounded, we have reached the 
desired conclusion. //// 


9.14 Convolutions of measures If » and A are complex Borel measures 
on R", then 


(1) ae | : I(x + y) du(x) daly) 


is a bounded linear functional on C,(R"), the space of all continuous func- 
tions on R” that vanish at infinity. By the Riesz representation theorem, 
there is a unique Borel measure p * A on R” that satisfies 


(2) | f d(u * 4) = | \ f(x + y) du(x) day) = [fe Co(R")]. 


A standard approximation argument shows that (2) then holds also for 
every bounded Borel function f In particular, we see that 


(3) (u* A)’ = pi. 


Two other consequences of (2) will be used in the next theorem. One 
is the almost obvious inequality 


(4) le * All < [all All, 


where the norm denotes total variation. The other is the fact that p * / is 
absolutely continuous (relative to Lebesgue measure m,) if this is true of y; 
for in that case, 


(5) | F(x + y) du(x) = 0 


for every y € R", if f is the characteristic function of a Borel set E with 
m,(E) = 0, and (2) shows that (uw * A) E) = 0. 

Recall that every complex Borel measure p has a unique Lebesgue 
decomposition 


(6) = Uy + Us, 


where py, is absolutely continuous relative to m, and p, is singular. 
The next theorem is due to Karlin. 
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9.15 Theorem Suppose pis a Borel probability measure on R, such that 


(1) 
(2) 


(3) 


LU, # 9, 


" Errore 


— @ 


M=| x du(x) 4 0. 


Suppose that fe L'(R), that f(x) >~0 as x > +0, and that @ is a 


bounded function that satisfies 


(4) 


P(x) — (b * WX) =f(x) (“OW <x < ow). 


Then the limits 


(6) (co) = lim 4%), G{—o0) = tim (9 
exist, and 
6) $1c0) — 20) = 32 |” sf dy 


PROOF. Put o = 6 — yp, as in the proof of Theorem 9.13. Define 


— p((— 00, x)) ifx <0, 


7) a Gao ns 00) if x > 0. 


The assumption (2) guarantees that K € L’(R). A straightforward com- 
putation, whose details we omit, shows that 


e int 7. . )@(O)/it if t #0 
(8) | K(xJe ix = {*" ie 


— 0c 


and that 
(9) [ 100) dx = (& # 48) —(K + on (—o <r<s< 0), 


sincef +o * a. 

By (1), w is not singular. The argument used at the beginning of 
the proof of Theorem 9.13 shows therefore that a(t) #0 if t #0. 
Hence (8) and (3) imply that K has no zero in R. 

Since f € L'(R), (9) implies that K * ¢ has limits at +00, whose 
difference is [°,, f. 

We shall show that @¢ is slowly oscillating. Once this is done, (5) 
and (6) follow from the properties of K and K * @ that we just 
proved, by Pitt’s theorem (b) of 9.7. 
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Repeated substitution of ¢ = f+ @¢ * yu into its right-hand side 
gives 
(10) P=ftfeutot+few + pep 
=fit nth, (n = 2, 3, 4, ..,), 
where wi =p, pw" =pe pw" ',f.=f+---+f* yw" 3, and 
(11) Gn=P*(U')a, hy= * (U’),. 


For each n, f(x) > 0 as x > +o, and g, is uniformly contin- 
uous. Hence f, +g, is slowly oscillating. Since the total variations 


satisfy 

(12) MH")sll S Mus)" S Well”, 

we have 

(13) [h,Ax)| < lll: lus" = (— 00 < x < oo), 


where ||¢|| is the supremum of |¢@| on R. By (1), ||y,|| < 1. Hence 
h, 20, uniformly on R. Consequently, @ is the uniform limit of the 
slowly oscillating functions f, + g,. This implies that ¢ is slowly oscil- 
lating, and completes the proof. //// 


Exercises 


1. Prove the theorem of Tauber stated in Section 9.1. 
2. Suppose ¢ € L*(R") and the support of the distribution ¢ consists of k distinct 


points s,,..., s,. Construct suitable functions y,, ..., w, such that (@ * y,)* has 
the singleton (s;} as support, and conclude that ¢ is a trigonometric polynomial, 
namely, 


g(x) = a,e'*+---+a,ek"* (ae). 


(The case k = 1 is done in the proof of Theorem 9.13.) 

3. Suppose Y is a closed translation-invariant subspace of L’(R") such that Z(Y) 
consists of k distinct points. (The notation is as in Theorem 9.3.) Use Exercise 2 
to prove that Y has codimension k in L’(R"), and conclude from this that Y 
consists of exactly those f € L'(R") whose Fourier transforms are 0 at every point 
of Z(Y). 

4. Prove the following analogue of (a) of Theorem 9.7: If @ € L°(R"), and if 
to every te R" corresponds a function K,¢ L(R") such that K,(t) #0 and 
(K, * \(x) > 0 as |x] — oo, then (f * @)\(x) > 0 as |x| — 00, for every fe L'(R"). 

5. Assume K e€ L'(R") and K has at least one zero in R". Show that then there 
exists @ € L°(R") such that (K * @)(x) = 0 for every x € R”, although @ does not 
satisfy the conclusion of (a) of Theorem 9.7. 

6. If d(x) = sin (x*), —00 < x < oo, show that 


lim (f * x) =0 


|x| 
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for every f € L'(R), although the conclusion of (b) of Theorem 9.7 does not hold. 
7. For « > 0, let f, be the characteristic function of the interval [0, «]. Define f, in 
the same way; put g=/,+/f,. Prove that the set of all finite linear com- 
binations of translates of g is dense in L’(R) if and only if B/a is irrational. 
8. If « > 0 and ax = 1, prove that 


1—a<ao[x] <1, 


and deduce from this that e*K(x) is bounded, as asserted in the proof of 
Theorem 9.12. 

9. Let Q denote the set of all rational numbers. Let y be a probability measure on 
R that is concentrated on Q, and let ¢ be the characteristic function of Q. Show 
that o(x) = (@ * p(x) for every x E R, although @ is not constant. (Compare 
with Theorem 9.13.) What other sets could be used in place of Q to achieve the 
same effect? 

10. Special cases of the following facts were used in Theorem 9.15. Prove them. 
(a) If ¢ € L*(R") and k € L'(R"), then k « ¢ is uniformly continuous. 
(b) If {@,} is a sequence of slowly oscillating functions on R" that converges 
uniformly to a function @, then @ is slowly oscillating. 
(c) If 4 and A are complex Borel measures on R”, then 


(ue * Adsl < [sll (Asi. 
11. Put (x) = cos (|x|'/%) and define 


l 1 
f(x) = we) 3 | Woe ydy — (—w <x <@). 
-1 
Prove that fe (L' ~ C,\R) but that no bounded solution of the equation 
1 1 
o(x) — 5 | oe — y) dy = f(x) 
-1 
has limits at +00 or at —oo. (This illustrates the relevance of the condition 
M # Oin Theorem 9.15.) 
12. Let pp be a probability measure concentrated on the integers. Prove that every 
function @ on R which is periodic with period | satisfies ¢ — @ * wp = 0. (This 1s 


relevant to Theorems 9.13 and 9.15.) 
13. Assume ¢ € L°(0, 00), 


ee d 2 d 
[1x91 < a, [“1H091 S < co, 
0 x 0 x 

- _, ax 
| K(x)x""* — #0 for —o <t< o, 
6 x 


and 


lim | . K(*)o0o cic) 
x—7o JO u u 


14. 


15. 
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lim | H(= ou a 
x7o J0 u u 


This is an analogue of (a) of Theorem 9.7. How would “slowly 
oscillating” have to be defined to obtain the corresponding analog of (b) of 
Theorem 9.7? 

Complete the details in the following outline of Wiener’s proof of Littlewood’s 
theorem. Assume |na,|<1, f(r)= > 9 a,r", and f(r)>-0 as rl. If 
S, =A) +--+ + 4,, it is to be proved that s, > 0 as n— oo. 

(a) |s, —f(1 — 1/n)| < 2. Hence {s,} is bounded. 

(b) If d(x) = s, on [n,n + 1) and 0 <x < y, then 


Prove that 


qe 
1d») — 6091 <2 —. 


(c) \" xe “(t) dt = f(e *) > 0 as x > 0. Hence 
) 


lim | “K(=)ou a? 
x7oa JO u u 


K(x) = (=) exp ( — 5) 


a 
(d) | K(x)x7#* = = (1 + it) 4 0 if tis real. 
0 x 
(e) Put H(x) = 1/(ex) if (1 + 6)” * <x < 1, H(x) = 0 otherwise. Conclude that 


1 (1 +e)x 
lim — | o(y) dy = 0. 
xo Ex x 


(f) By (b) and (e), lim (x) = 0. 
Note: Ifna, — 0 is assumed to hold, then a modification of step (a) is all that is 
needed for the proof. 
Let Y be a closed subspace of L’(R"). Prove that Y is translation-invariant if and 
only if f * g € Y whenever fe Y and g € L'(R’). 

The closed translation invariant subspaces of [’(R") are thus exactly the 
same as the closed ideals in the convolution algebra L'(R"). 
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Introduction 


10.1. Definition A complex algebra is a vector space A over the complex 
field ¢ in which a multiplication is defined that satisfies 


(1) X(yZ) = (xy)z, 

(2) (x + y)zZ = xz + yz, xX(y + Zz) = xy + xz, 
and 

(3) a(xy) = (ax)y = x(ay) 


for all x, y, and z in A and for all scalars «a. 
If, in addition, A is a Banach space with respect to a norm that 
satisfies the multiplicative inequality 


(4) xy < Ixilyll (eA, ye A) 
and if A contains a unit element e such that 

(5) xe=ex =x (x € A) 

and 

(6) llell = 1, 


then A 1s called a Banach algebra. 
Note that we have not required that A be commutative, ie., that 
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xy = yx for all x and y in A, and we shall not do so except when explicitly 
stated. 

It is clear that there is at most one e € A that satisfies (5), for if e’ also 
satisfies (5), then e’ = e’e =e. 

The presence of a unit is very often omitted from the definition of a 
Banach algebra. However, when there is a unit it makes sense to talk about 
inverses, so that the spectrum of an element of A can be defined in a more 
natural way than is otherwise possible. This leads to a more intuitive devel- 
opment of the basic theory. Moreover, the resulting loss of generality is 
small, because many naturally occurring Banach algebras have a unit, and 
because the others can be supplied with one in the following canonical 
fashion. 

Suppose A satisfies conditions (1) to (4), but A has no unit element. 
Let A, consist of all ordered pairs (x, «), where x € A and ae @. Define the 
vector space operations in A, componentwise, define multiplication in A, 
by 


(7) (x, ay, B) = (xy + ay + Bx, «B), 
and define 
(8) I(x, «| = |x| +loa],  e=(, 1). 


Then A, satisfies properties (1) to (6), and the mapping x > (x, 0) is an 
isometric isomorphism of A onto a subspace of A, (in fact, onto a closed 
two-sided ideal of A,) whose codimension is 1. If x is identified with (x, 0), 
then A, is simply A plus the one-dimensional vector space generated by e. 
See Examples 10.3(d) and 11.13(e). 

The inequality (4) makes multiplication a continuous operation in A. 
This means that if x,->x and y,— y then x, y, ~ xy, which follows from 
the identity 


(9) Xn Vn — XY = (X_ — X)Vn + X(Vn — Y)- 
In particular, multiplication 1s left-continuous and right-continuous: 
(10) X,Y xy and X), > Xy 


ifx, x and y, > y. 

It is interesting that (4) can be replaced by the (apparently) weaker 
requirement (10) and that (6) can be dropped without enlarging the class of 
algebras under consideration. 


10.2. Theorem Assume that A is a Banach space as well as a complex 
algebra with unit element e 4 0, in which multiplication is left-continuous and 
right-continuous. Then there is a norm on A which induces the same topology 
as the given one and which makes A into a Banach algebra. 


(The assumption e # 0 rules out the uninteresting case A = {0}.) 
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PROOF. Assign to each xé€A _ the left-multiplication operator M, 
defined by 


(1) M (z) = xz (z € A). 


Let A be the set of all M,. Since right multiplication is assumed to be 
continuous, A < @(A), the Banach space of all bounded linear oper- 
ators on A. 

It is clear that x + M, is linear. The associative law implies that 
M,, = M,.M,.Ifx e€ A, then 


(2) |x|] = xell = || M, el] < |M,I llell. 


These facts can be summarized by saying that x > M, is an iso- 
morphism of A onto the algebra A, whose inverse is continuous. Since 


(3) IM, Myll <M, My and ||M,|| = IZ] = 1, 


A is a Banach algebra, provided it is complete, i.e., provided it is a 
closed subspace of @(A), relative to the topology given by the operator 
norm. (See Theorem 4.1.) Once this is done, the open mapping 
theorem implies that x > M,, is also continuous. Hence ||x|| and || M,|| 
are equivalent norms on A. 

Suppose T € @(A), T; € A, and T;—> T in the topology of A&A). 
If T; is left multiplication by x; € A, then 


(4) Ty) = xiv = (x:e)y = Tye)y. 


As i— oo, the first term in (4) tends to T(y), and Te) — T(e). Since 
multiplication is assumed to be left-continuous in A, it follows that 
the last term of (4) tends to T(e)y. Put x = T(e). Then 


(5) T(y) = T(e)y=xy=My) (ye A), 
so that T = M, € A, and A is closed. //// 


The theorem says, in particular, that, in the presence of completeness, 
left continuity plus right continuity implies “joint” continuity. Exercise 6 
shows that this may fail in normed linear algebras that are not complete. 


10.3. Examples (a) Let C(K) be the Banach space of all complex contin- 
uous functions on a nonempty compact Hausdorff space K, with the 
supremum norm. Define multiplication in the usual way: (fg)\(p) =f (p)g(p). 
This makes C(K) into a commutative Banach algebra; the constant function 
1 is the unit element. 

If K is a finite set, consisting of, say, n points, then C(K) is simply @”, 
with coordinatewise multiplication. 
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In particular, when n= 1, we obtain the simplest Banach algebra, 
namely ¢, with the absolute value as norm. 


(b) Let X be a Banach space. Then &(X), the algebra of all bounded 
linear operators on X, is a Banach algebra, with respect to the usual oper- 
ator norm. The identity operator I is its unit element. If dim X =n < o, 
then @(X) is (isomorphic to) the algebra of all complex n-by-n matrices. If 
dim X > 1, then &(X) is not commutative. (The trivial space X = {0} must 
be excluded.) 

Every closed subalgebra of @(X) that contains I is also a Banach 
algebra. The proof of Theorem 10.2 shows, in fact, that every Banach 
algebra is isomorphic to one of these. 


(c) If K is a nonempty compact subset of ¢, or of ¢", and if A is the 
subalgebra of C(K) that consists of those f € C(K) that are holomorphic in 
the interior of K, then A is complete (relative to the supremum norm) and is 
therefore a Banach algebra. 

When K is the closed unit disc in ¢, then A is called the disc algebra. 


(d) '(R"), with convolution as multiplication, satisfies all requirements 
of Definition 10.1, except that it lacks a unit. One can adjoin one by the 
abstract procedure outlined in Section 10.1 or one can do it more con- 
cretely by enlarging I’(R") to the algebra of all complex Borel measures yu 
on R” of the form 


du =f dm, + dd 


where f € L’(R"), 6 is the Dirac measure on R", and 4 is a scalar. 


(e) Let M(R") be the algebra of all complex Borel measures on R", with 
convolution as multiplication, normed by the total variation. This is a com- 
mutative Banach algebra, with unit 6, which contains (d) as a closed sub- 
algebra. 


10.4 Remarks There are several reasons for restricting our attention to 
Banach algebras over the complex field, although real Banach algebras 
(whose definition should be obvious) have also been studied. 

One reason is that certain elementary facts about holomorphic func- 
tions play an important role in the foundations of the subject. This may be 
observed in Theorems 10.9 and 10.13 and becomes even more obvious in 
the symbolic calculus. 

Another reason—one whose implications are not quite so obvious—is 
that ¢ has a natural nontrivial involution (see Definition 11.14), namely, 
conjugation, and that many of the deeper properties of certain types of 
Banach algebras depend on the presence of an involution. (For the same 
reason, the theory of complex Hilbert spaces is richer than that of real 
ones.) 
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At one point (Theorem 10.34) a topological difference between ¢ and 
R will even play a role. 

Among the important mappings from one Banach algebra into 
another are the homomorphisms. These are linear mappings h that are also 
multiplicative : 


h(xy) = h(x)h(y). 
Of particular interest is the case in which the range is the simplest of all 
Banach algebras, namely, ¢ itself. Many of the significant features of the 
commutative theory depend crucially on a sufficient supply of homo- 
morphisms onto ¢. 
Complex Homomorphisms 


10.5 Definition Suppose A is a complex algebra and @ is a linear func- 
tional on A which is not identically 0. If 


(1) P(xY) = P(x) P(y) 


for all x € A and y € A, then @ is galled a complex homomorphism on A. 
(The exclusion of ¢ = 0 is, of course, just a matter of convenience.) 
An element x € A is said to be invertible if it has an inverse in A, that 
is, if there exists an element x1 € A such that 


(2) x tx =xx7! =e, 


where e is the unit element of A. 
Note that no x € A has more than one inverse, for if yx = e = xz then 


y = ye = (xz) = (yx)z = ez = z. 
10.6 Proposition If ¢ is a complex homomorphism on a complex algebra 
A with unit e, then o(e) = 1, and (x) 4 0 for every invertible x € A. 


PROOF. For some y € A, $(y) # 0. Since 
Py) = (ye) = P(y)Ple), 
it follows that d(e) = 1. If x is invertible, then 
P(x)O(x~*) = P(xx™ ") = Ge) = 1, 
so that d(x) # 0. Hf 


Parts (a) and (c) of the following theorem are perhaps the most widely 
used facts in the theory of Banach algebras; in particular, (c) implies that all 
complex homomorphisms of Banach algebras are continuous. 


10.7. Theorem Suppose A is a Banach algebra, x € A, ||x|| <1. Then 
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(a) e—x is invertible, 
ele 
1 — |x| 


(c) |(x)| < 1 for every complex homomorphism ¢ on A. 


(b) |(e—x)* -e-xI< 


PROOF. Since ||x”|| < ||x||" and ||x|| < 1, the elements 
(1) S, =O tx+tx74---4x" 


form a Cauchy sequence in A. Since A is complete, there exists s € A 
such that s, > s. Since x" > 0 and 


(2) s,°(@—x) =e—x"** =(e—x)'s,, 


the continuity of multiplication implies that s is the inverse of e — x. 
Next, (1) shows that 


_lixl? 
1 — |x| 


Finally, suppose A € €,|A| > 1. By (a), e — A” ‘x is invertible. By 
Proposition 10.6, 


ee) 
|s—e— xl = x? +23 4 -- < Yel = 
n=2 


1—A~*(x) = d(e — A~*x) £0. 
Hence ¢(x) # 4. This completes the proof. //// 


We now interrupt the main line of development and insert a theorem 
which shows, for Banach algebras, that Proposition 10.6 actually character- 
izes the complex homomorphisms among the linear functionals. This strik- 
ing result has apparently found no interesting applications as yet. 


10.8 Lemma _ Suppose f is an entire function of one complex variable, 
f (0) = 1, f'(0) = 0, and 


(1) 0<|fAl sel (eg). 
Then f(A) = 1 for all A € €. 


PROOF. Since f has no zero, there is an entire function g such that 
f=exp {g}, g(0) = g'(0)=0, and Re [g(A)] <|A|. This inequality 
implies 

(2) IAA} <|2r—g(A)| (JA Sr). 

The function 


r?g(A) 


(3) h,(4) = pr —gAl 
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is holomorphic in {A:|2| < 2r}, and |h,(A)| <1 if |A| =r. By the 
maximum modulus theorem, 

(4) [AMAL <1 (fal <7). 
Fix A and let r > oo. Then (3) and (4) imply that g(A) = 0. //// 
10.9 Theorem (Gleason, Kahane, Zelazko) If ¢ is a linear functional 


on a Banach algebra A, such that $(e) = 1 and (x) # 0 for every invertible 
x € A, then 


(1) O(xy) = O(x)P(y) (x € A, ye A). 


Note that the continuity of ¢ is not part of the hypothesis. 


PROOF. Let N be the null space of @. If x € A and y € A, the assump- 
tion o(e) = 1 shows that 


(2) x=at G(xje, ytb+ Pye, 


where ae N, be N. If ¢ is applied to the product of the equations (2), 
one obtains 


(3) P(xy) = P(ab) + O(x)O(y). 

The desired conclusion (1) is therefore equivalent to the assertion that 

(4) abe N ifae N and beN. 
Suppose we had proved a special case of (4), namely, 

(5) a*~eN ifae N. 

Then (3), with x = y, implies 

(6) $(x*) =[O(x)]* (x € A). 

Replacement of x by x + y in (6) results in 

(7) P(xy + YX) = 29(x)P(y) (x E A, ye A). 

Hence 

(8) xy+yxeN ifxeN, yea. 


Consider the identity 


(9) (xy — yx)? + (xy + yx)? = 2[x(yxy) + (yxy)x]. 


If x e N, the right side of (9) is in N, by (8), and so is (xy + yx)’, by (8) 
and (6). Hence (xy — yx)” is in N, and another application of (6) yields 


(10) xy—yzeEN ifxeN, yeA. 
Addition of (8) and (10) gives (4), hence (1). 
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Thus (5) implies (1), for purely algebraic reasons. The proof of (5) 
uses analytic methods. 

By hypothesis, N contains no invertible element of A. Thus 
|e — x|| > 1 for every x € N, by (a) of Theorem 10.7. Hence 


(11) [Ae — x|| > |A| =| de — x)| (xe N,A€ QC). 


We conclude that ¢ is a continuous linear functional on A, of norm 1. 
To prove (5), fix ae N, assume |la|| = 1 without loss of gener- 
ality, and define 
= 9a") 
(12) AC) ie 


n=0 


A” (Ae). 


Since | d(a")| < |la"|| < lla" = 1, f is entire and satisfies | f(A)| < 
exp |A| for all A € €. Also, f(0) = d(e) = 1, and f’(0) = (a) = 0. 

If we can prove that f(A) # 0 for every 4 € ¢, Lemma 10.8 will 
imply that f”(0) = 0; hence ¢(a”) = 0, which proves (5). 

The series 


A 
(13) EA) = Y) a" 


converges in the norm of A, for every Ae ¢. The continuity of @ 
shows that 


(14) F(A) = P(E) (Ae ©). 

The functional equation E(A + py) = E(A)E(u) follows from (13) exactly 
as in the scalar case. In particular, 

(15) E(A)E(— 4) = E(O) = e (A€ ¢). 


Hence E(A) is an invertible element of A, for every 4 € ¢. This implies, 
by hypothesis, that @(E(A)) # 0, and therefore f(A) # 0, by (14). This 
completes the proof. py 


Basic Properties of Spectra 


10.10 Definitions Let A be a Banach algebra; let G = G(A) be the set 
of all invertible elements of A. If x €e G and y € G, then y 'x is the inverse 
of x” ty; thus x 'y € G, and G is a group. 


If x € A, the spectrum o(x) of x is the set of all complex numbers 4 


such that Ae — x is not invertible. The complement of o(x) is the resolvent 
set of x; it consists of all A € ¢ for which (Ae — x)~' exists. 


(1) 


The spectral radius of x is the number 


p(x) = sup {|A]: 2 € o(x)}. 
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It is the radius of the smallest closed circular disc in ¢, with center at 0, 
which contains o(x). Of course, (1) makes no sense if o(x) is empty. But this 
never happens, as we shall see. 


10.11 Theorem Suppose A is a Banach algebra, x € G(A), he A, and 
\|hl| < 4x17. Then x + h € G(A), and 


(1) Ie + Ayo * = xo 8 + xT Thx] < Ix * PAI. 


PROOF. Since x +h =x(e +x 'h) and ||x~'h|| <4, Theorem 10.7 
implies that x + h € G(A) and that the norm of the right member of 
the identity 


(x+h)>'—x7'4+x thx t=[(e+x th) '-—e+x th]x7} 
is at most 2||x~ 'h||?||x~ +]. //// 


10.12 Theorem If A is a Banach algebra, then G(A) is an open subset of 
A, and the mapping x > x‘ is a homeomorphism of G(A) onto G(A). 


PROOF. That G(A) is open and that x + x7! is continuous follows 


from Theorem 10.11. Since x ~ x~ + maps G(A) onto G(A) and since it 
is its own inverse, it is a homeomorphism. //// 


10.13. Theorem [If A is a Banach algebra and x € A, then 


(a) the spectrum o(x) of x is compact and nonempty, and 
(b) the spectral radius p(x) of x satisfies 


(1) p(x) = lim |x") = inf |x|" 


n> 00 n>1 


Note that the existence of the limit in (1) 1s part of the conclusion and 
that the inequality 


(2) p(x) < ||xl 


is contained in the spectral radius formula (1). 


PROOF. If |A| > |x|] then e — A~'x lies in G(A), by Theorem 10.7, and 
so does Ae — x. Thus A ¢ o(x). This proves (2). In particular, o(x) is a 
bounded set. 

To prove that o(x) 1s closed, define g: € > A by g(A) = Ae — x. 
Then g is continuous, and the complement Q of o(x) is g~ ‘(G(A)), 
which is open, by Theorem 10.12. Thus o(x) 1s compact. 

Now define f : Q — G(A) by 


(3) f(a) = (le — x)? (A € Q). 
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Replace x by Ae — x and h by (u — Aje in Theorem 10.11. If A € Q and 
wis sufficiently close to A, the result of this substitution is 


(4) If —f™A™+uU-ALAI < 2fHM Plu ap, 
so that 
(5) tim LMI _ pray) ae Q) 


Uva ph — A 


Thus fis a strongly holomorphic A-valued function in Q. 
If |A| > ||x||, the argument used in Theorem 10.7 shows that 


(6) f(A = ee 1x" = Qo-te + QA-*x+-:: 

This series converges uniformly on every circle I, with center at 0 and 
radius r > ||x||. By Theorem 3.29, term-by-term integration 1s there- 
fore legitimate. Hence 


1 
(7) x" = —— | Af(A)dA — (r > ||x|], n =, 1, 2, ...). 
2ni Jr, 


If o(x) were empty, Q would be ¢, and the Cauchy theorem 3.31 
would imply that all integrals in (7) are 0. But when n = 0, the left- 
hand side of (7) is e #0. This contradiction shows that o(x) is not 
empty. 

Since 2 contains all A with |A| > p(x), an application of (3) of 
the Cauchy theorem 3.31 shows that the condition r > ||x|| can be 
replaced in (7) by r > p(x). If 


(8) M(r) = mae If(re")l = > pd), 


the continuity of fimplies that M(r) < oo. Since (7) now gives 


(9) |x"|| <r" * MO), 

we obtain 

(10) lim sup ||x"|"<r — (r > p(x)) 
so that 

(11) lim sup ||x"||*/" < p(x). 


On the other hand, if A € o(x), the factorization 
(12) Ane — x" = (he — xYA" be + pre +x" 1) 
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shows that 4"e—x" is not invertible. Thus 4” € o(x"). By (2), 


|A"| < ||x"|| for n = 1, 2, 3,... . Hence 
(13) p(x) < inf ||x"||"”", 
n>1 
and (1) is an immediate consequence of (11) and (13). //// 


The nonemptiness of o(x) leads to an easy characterization of those 
Banach algebras that are division algebras. 


10.14 Theorem (Gelfand-Mazur) If A is a Banach algebra in which 
every nonzero element is invertible, then A is (isometrically isomorphic to) the 
complex field. 


PROOF. If x € A and A, #A,, then at most one of the elements 
A,e —x and A,e — x is 0; hence at least one of them is invertible. 
Since o(x) is not empty, it follows that o(x) consists of exactly one 
point, say A(x), for each x € A. Since A(x)e — x is not invertible, it is 0. 
Hence x = A(x)e. The mapping x > A(x) is therefore an isomorphism 
of A onto ¢, which is also an isometry, since | A(x)| = ||A(x)e|| = |x| 
for every x € A. 


Theorems 10.13 and 10.14 are among the key results of this chapter. 
Much of the content of Chapters 11 to 13 1s independent of the remainder 
of Chapter 10. 


10.15 Remarks (a) Whether an element of A is or is not invertible in A 
is a purely algebraic property. The spectrum and the spectral radius of an 
x € A are thus defined in terms of the algebraic structure of A, regardless of 
any metric (or topological) considerations. On the other hand, lim ||x"||*” 
depends obviously on metric properties of A. This is one of the remarkable 
features of the spectral radius formula: It asserts the equality of certain 
quantities which arise in entirely different ways. 


(b) Our algebra A may be a subalgebra of a larger Banach algebra B, 
and it may then very well happen that some x € A is not invertible in A but 
is invertible in B. The spectrum of x depends therefore on the algebra. The 
inclusion o,(x) > o,(x) holds (the notation is self-explanatory); the two 
spectra can be different. The spectral radius 1s, however, unaffected by the 
passage from A to B, since the spectral radius formula expresses it in terms 
of metric properties of powers of x, and these are independent of anything 
that happens outside A. 

Theorem 10.18 will describe the relation between o,(x) and o,(x) in 
greater detail. 
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10.16 Lemma _ Suppose V and W are open sets in some topological space 
X, V < W, and W contains no boundary point of V. Then V is a union of 
components of W. 


Recall that a component of W is, by definition, a maximal connected 
subset of W. 


PROOF. Let Q be a component of W that intersects V. Let U be the 
complement of V. Since W contains no boundary point of V, Q is the 
union of the two disjoint open sets Qo V and Qo U. Since Q is 
connected, Q ~ U is empty. ThusQ c V. //// 


10.17 Lemma_ Suppose A is a Banach algebra, x, € G(A) for n = 1, 2, 3, 
..., X is a boundary point of G(A), and x, > x asn— oo. 
Then ||x, "|| + 0 asn—- oo. 


PROOF. If the conclusion is false, there exists M < oo such that 
|x, ‘|| <M for infinitely many n. For one of these, ||x, — x|| < 1/M. 
For this n, 


le — xq XI] = IX" %q — OIL <1, 


so that x, 1x € G(A). Since x = x,(x, 1x) and G(A) is a group, it follows 
that x € G(A). This contradicts the hypothesis, since G(A) is open. //// 


10.18 Theorem 


(a) If A is aclosed subalgebra of a Banach algebra B, and if A contains the 
unit element of B, then G(A) is a union of components of A ~ G(B). 


(b) Under these conditions, if x € A, then a ,(x) is the union of o,(x) and a 
(possibly empty) collection of bounded components of the complement of 
6 ,(x). In particular, the boundary of o ,(x) lies in o,(x). 


PROOF. (a) Every member of A that has an inverse in A has the same 
inverse in B. Thus G(A) c G(B). Both G(A) and A m G(B) are open 
subsets of A. By Lemma 10.16, it 1s sufficient to prove that G(B) con- 
tains no boundary point y of G(A). 

Any such y is the limit of a sequence {x,} in G(A). By Lemma 
10.17, |x, "|| > oo. If y were in G(B), the continuity of inversion in 
G(B) (Theorem 10.12) would force x, ' to converge to y~'. In particu- 
lar {||x,*||} would be bounded. Hence y ¢ G(B), and (a) is proved. 

(b) Let Q, and Q, be the complements of o,(x) and of o,(x), 
relative to ¢. The inclusion Q, <Q, is obvious, since 4 € Q, if and 
only if Ae — x € G(A). Let A, be a boundary point of Q,. Then 
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Ayeé — x is a boundary point of G(A). By (a), A,e — x € G(B). Hence 
Ay € Qz3. Lemma 10.16 implies now that Q, is the union of certain 
components of 0,. The other components of Q, are therefore subsets 
of a ,(x). This proves (b). ae 


Corollaries. Suppose x € Ac B. 


(a) If o,(x) does not separate ¢, i.e., if its complement Q, is connected, 
then o ,(x) = 0,(x). 

(b) If o,(x) is larger than o,(x), then o ,(x) is obtained from o,(x) by 
“filling in some holes” in o;(x). 

(c) Ifo,(x) has empty interior, then o ,(x) = 0,(x). 


The most important application of this corollary occurs when 
6 ,(x) contains only real numbers. 


As another application of Lemma 10.17 we now prove a theorem 
whose conclusion is the same as that of the Gelfand-Mazur theorem, 
although its consequences are not nearly so important. 


10.19 Theorem If A is a Banach algebra and if there exists M < oo such 
that 


(1) llxl Iyll < Mllxyll («€ A, y € A), 


then A is (isometrically isomorphic to) €. 


PROOF. Let y be a boundary point of G(A). Then y = lim y, for some 
sequence {y,} in G(A). By Lemma 10.17, ||y, *|| > 00. By hypothesis, 


(2) yall lyn I< Mile = (n= 1, 2,3,...), 


Hence |ly,,|| ~ 0 and therefore y = 0. 
If x € A, each boundary point / of o(x) gives rise to a boundary 
point Ae — x of G(A). Thus x = de. In other words, A = {Ae: Ae €). 


//// 


It is natural to ask whether the spectra of two elements x and y of A 
are close together, 1n some suitably defined sense, if x and y are close to 
each other. The next theorem gives a very simple answer. 


10.20 Theorem Suppose A is a Banach algebra, x € A, Q is an open set 
in {, and o(x) <Q. Then there exists 6 > 0 such that o(x + y) < Q for every 
y € Awith ||y|| < 6. 
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PROOF. Since ||(Ae — x)~*|| is a continuous function of 4 in the com- 
plement of o(x), and since this norm tends to 0 as 4 — oo, there is a 
number M < oo such that 


(Ae — x)" |< M 
for all 2 outside Q. If y € A, || y|| < 1/M, and A ¢ Q, it follows that 
he — (x + y) = (Ae — x)[e — (Ae — x) *y] 


is invertible in A, since ||(Ae — x)~'y|| <1; hence A ¢ o(x + y). This 
gives the desired conclusion, with 6 = 1/M. //// 


Symbolic Calculus 


10.21 Introduction If x is an element of a Banach algebra A and if 
f(A) = % +°:: +a,4" is a polynomial with complex coefficients «;, there 
can be no doubt about the meaning of the symbol f(x); it obviously denotes 
the element of A defined by 


f(x) =Aetayxte--+a4,x". 


The question arises whether f(x) can be defined in a meaningful way for 
other functions f. We have already encountered some examples of this. For 
instance, during the proof of Theorem 10.9 we came very close to defining 
the exponential function in A. In fact, if f(A) = )° «, 4* is any entire function 
in ¢, it is natural to define f(x) € A by f(x) =). a, x*; this series always 
converges. Another example is given by the meromorphic functions 


1 


1) arr. 


In this case, the natural definition of f(x) is 


F(x) = (we — x)" 


which makes sense for all x whose spectrum does not contain «. 

One is thus led to the conjecture that f(x) should be definable, within 
A, whenever f is holomorphic in an open set that contains o(x). This turns 
out to be correct and can be accomplished by a version of the Cauchy 
formula that converts complex functions defined in open subsets of ¢ to 
A-valued ones defined in certain open subsets of A. (Just as in classical 
analysis, the Cauchy formula is a much more adaptable tool than the power 
series representation.) Moreover, the entities f(x) so defined (see Definition 
10.26) turn out to have interesting properties. The most important of these 
are summarized in Theorems 10.27 to 10.29. 

In certain algebras one can go further. For instance, if x is a bounded 
normal operator on a Hilbert space H, the symbol f(x) can be interpreted 
as a bounded normal operator on H when f is any continuous complex 
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function on o(x), and even when f is any complex bounded Borel function 
on o(x). In Chapter 12 we shall see how this leads to an efficient proof of a 
very general form of the spectral theorem. 


10.22 Integration of A-valued functions If A is a Banach algebra and 
fis a continuous A-valued function on some compact Hausdorff space Q on 
which a complex Borel measure p is defined, then | f du exists and has all 
the properties that were discussed in Chapter 3, simply because A is a 
Banach space. However, an additional property can be added to these and 
will be used in the sequel, namely: If x € A, then 


(1) x | fdp= | sr du(p) 
Q Q 
and 
(2) ( | f au = | f(p)x du(p) 
Q Q 


To prove (1), let M, be left multiplication by x, as in the proof of 
Theorem 10.2, and let A be a bounded linear functional on A. Then AM, is 
a bounded linear functional. Definition 3.26 implies therefore that 


am, | fau= | AM. du=A [on du, 
Q Q Q 


for every A, so that 


Me | fdu= | (M1 de 
Q Q 


which is just another way of writing (1). To prove (2), interpret M, to be 
right multiplication by x. 


10.23. Contours Suppose K is a compact subset of an open c @, and 
I is a collection of finitely many oriented line intervals y,, ..., y, in Q, none 
of which intersects K. In this situation, integration over I is defined by 


(1) [oa dh = d GIA) a. 


It is well known that I can be so chosen that 


1 f da fi iffeK 
(2) Ind (= 55 | = He if ¢.Q 
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and that the Cauchy formula 


1 
(3) f= [a-9-V@ dA 


then holds for every holomorphic function f in Q and for every ¢ € K. See, 
for instance, Theorem 13.5 of [23]. 

We shall describe the situation (2) briefly by saying that the contour T 
surrounds K inQ. 

Note that neither K nor Q nor the union of the intervals y; has been 
assumed to be connected. 


10.24 Lemma Suppose A is a Banach algebra, x € A, a€ C, a ¢ a(x), Q 
is the complement of « in €, and T surrounds o(x) in Q. Then 


(1) se | @— ave — x" dd = (ae — x)" (0 A 22); 


PROOF. Denote the integral by y,. When 4 € a(x), then 
(Ae — x)" * = (we — x) * + (a — Al(ae — x) (Ae — x)7?. 


By Section 10.22, y, is therefore the sum of 


1 
(2) oe =x) 5 | (wap dh =o, 
271 Jr 
since Ind; («) = 0, and 
1 

(3) (ae — x)! - ag [@— arte — x)" dA. 

2ni Jr 
Hence 
(4) (He — X)Vn = Wnt (n = 0, +1, ey ais )e 


This recursion formula shows that (1) follows from the case 
n = 0. We thus have to prove that 


(5) sx | de- 94 dime 


Let I, be a positively oriented circle, centered at 0, with radius 
r> |x||.OnT,,(e—x)~' =>) A" *x". Termwise integration of this 
series gives (5), with I, in place of I. Since the integrand in (5) is a 
holomorphic A-valued function in the complement of a(x) (see the 
proof of Theorem 10.13), and since 


(6) Ind;, (0) = 1 = Ind; (0) 
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for every € € a(x), the Cauchy theorem 3.31 shows that the integral (5) 
is unaffected if C is replaced by ,. This completes the proof. //T/ 


10.25 Theorem Suppose 
(1) R(A) = P(A) + Do Cm, (A — mn) * 
m,k 
is a rational function with poles at the points «,,. [.P is a polynomial, and the 


sum in (1) has only finitely many terms.]| If x € A and if o(x) contains no pole 
of R, define 


(2) R(x) = P(x) + > Cig, y(X — Ome). 


If Q is an open set in ¢ that contains o(x) and in which R is holomorphic, and 
if [ surrounds o(x) in Q, then 


(3) R(x) = = | R(AyAe — x)" da. 
PROOF. Apply Lemma 10.24. //// 


Note that (2) is certainly the most natural definition of a rational func- 
tion of x € A. The conclusion (3) shows that the Cauchy formula achieves 
the same result. This motivates the following definition. 


10.26 Definition Suppose A is a Banach algebra, Q is an open set in €, 
and H(Q) is the algebra of all complex holomorphic functions in 2. By 
Theorem 10.20, 


(1) Ag = {x € A: a(x) < Q} 
is an open subset of A. 


We define (Ag) to be the set of all A-valued functions f, with domain 
Ag, that arise from an f € H(Q) by the formula 


Z 1 
(2) L0)= 5 | f(ayae — x)~* da, 


where I is any contour that surrounds o(x) in Q. 
This definition calls for some comments. 


(a) Since T’ stays away from o(x) and since inversion is continuous in 
A, the integrand is continuous in (2), so that the integral exists and defines 


~ 


f(x) as an element of A. 


(b) The integrand is actually a holomorphic A-valued function in the 
complement of a(x). (This was observed in the proof of Theorem 10.13. See 
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Exercise 3.) The Cauchy theorem 3.31 implies therefore that f(x) is indepen- 
dent of the choice of T, provided only that I surrounds o(x) in Q. 


(c) If x = we and a € Q, (2) becomes 


(3) f (ue) =f (a)e. 


Note that ae € Ag if and only if « e Q. If we identity Ae ¢ with Jee A, 
every f € H(Q) may be regarded as mapping a certain subset of Ag (namely, 
the intersection of Ag with the one-dimensional subspace of A generated by 
e) into A, and then (3) shows that f may be regarded as an extension of f. 

In most treatments of this topic, f(x) is written in place of our f(x). 
The notation f is used here because it avoids certain ambiguities that might 
cause misunderstandings. 

(d) If S is any set and A is any algebra, the collection of all A-valued 
functions on S is an algebra, if scalar multiplication, addition, and multipli- 
cation are defined pointwise. For instance, if u and v map S into A, then 


(uv)(s) = u(s)v(s) (s € S). 
This will be applied to A-valued functions defined in Ag. 


10.27 Theorem Suppose A, H(Q), and H(Ag) are as in Definition 10.26. 
Then H(Ag) is a complex algebra. The mapping f-f is an algebra iso- 
morphism of H(Q) onto H(Ag) which is continuous in the following sense: 

If f, € A(Q) (n = 1, 2, 3, ...) and f, >f uniformly on compact subsets of 
Q), then 


(1) f(x) = lim f(x) (x € Ag). 
If uA) =2 and v(A)=1 in Q, then (x)= x and w(x) =e for every 
x € Ag. 


PROOF. The last sentence follows from Theorem 10.25. The integral 
representation (2) in Section 10.26 makes it obvious that f— fis linear. 
If f = 0, then 


(2) f(we=f(ae)=0 (we Q, 


so that f = 0. Thus f—> fis one-to-one. 

The asserted continuity follows directly from the integral (2) in 
Section 10.26, since ||(Ae — x)~'|| is bounded on I. (Use the same 
for all f,, and apply Theorem 3.29.) 

It remains to be proved that ff is multiplicative. Explicitly, if 
fe H(Q), g € H(Q), and h(A) = f(A)g(A) for all 2 € Q, it has to be shown 
that 


(3) h(x) =f (x)G(x) (x € Ag). 
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If f and g are rational functions without poles in Q, and if h = fg, 
then h(x) = f(x)g(x), and since Theorem 10.25 asserts that R(x) = R(x), 
(3) holds. In the general case, Runge’s theorem (Th. 13.9 of [23]) 
allows us to approximate f and g by rational functions f, and g,,, 
uniformly on compact subsets of Q. Then fg, converges to h in the 
same manner, and (3) follows from the continuity of the mapping 


fo. //// 


Since H(Q) is obviously a commutative algebra, Theorem 10.27 
implies that H(Ag) is also commutative. This may be surprising, because 
f(x) and f(y) need not commute. However, f(x) and g(x) do commute in A, 
for every x € Ag. Hence fg = Gf, by Definition 10.26 (d). 


10.28 Theorem Suppose x € Ag and fe H(Q). 


(a) f(x) is invertible in A if and only if f(A) ¥ 0 for every A € a(x). 
(b)  o( f(x) = f(a). 


Part (b) is called the spectral mapping theorem. 


PROOF. (a) If f has no zero on o(x), then g = 1/f 1s holomorphic in an 
open set Q, such that o(x) <Q, <Q. Since fg = 1 in Q,, Theorem 
10.27 (with Q, in place of Q) shows that f(x)9(x) = e, and thus f(x) is 
invertible. Conversely, if f(«) = 0 for some a € o(x) then there exists 
h € H(Q) such that 


(1) (A—ah(A)=f(4) “Ae Q), 

which implies 

(2) (x — ae)h(x) = f (x) = h(x)(x — ae), 

by Theorem 10.27. Since x — ae is not invertible in A, neither is f(x), 
by (2). 


(b) Fix B € ¢. By definition, B € o( f(x)) if and only if f(x) — Be is 
not invertible in A. By (a), applied to f — B is place of f, this happens if 
and only if f — B has a zero in o(x), that is, if and only if B € f(o(x)). 

//// 


The spectral mapping theorem makes it possible to include composi- 
tion of functions among the operations of the symbolic calculus. 


10.29 Theorem Suppose x € Ag, f € H(Q), Q, is an open set containing 
fS(o(x)), g € HQ,), and h(A) = g(f(A)) in Q,, the set of all AGCQ with 
f(A) € Qy. 
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Then f (x) € Ag, and h(x) = g(f (x)). 
Briefly,h =g -fifh=gof. 


PROOF. By (b) of Theorem 10.28, o( f(x)) ¢ Q,, and therefore 9(f(x)) is 
defined. 

Fix a contour I’, that surrounds f(o(x)) in Q,. There is an open 
set W, with o(x) < W <Q), so small that 


(1) Indr, (fA)=1 Ge W). 


Fix a contour I, that surrounds o(x) in W. If € € T,, then 1/(¢ —f) € 
H(W). Hence Theorem 10.27, with W in place of Q, shows that 


: 1 
(2) [Ce -f(x)]-* = ae { [C—f(a]}*"Ge—x) ida (CET). 


Since I’, surrounds o(f(x)) in Q,, (1) and (2) imply 


2 1 - 
gf (x)) = a [ g(O)le —f(x)]~* dl 
_i 
Oni 


1 
| oa | HOLE — FAY? de (ae — x)-* a 


= = | g(f(A)\Ae — x)" ' da = = | h(A\(Ae — x)~* dd = h(x). 
2ni To 2ni To 
//// 


We shall now give some applications of this symbolic calculus. The 
first one deals with the existence of roots and logarithms. To say that an 
element x € A has an nth root in A means that x = y” for some ye A. If 
x = exp (y) for some y € A, then y is a logarithm of x. 

Note that exp (y) = )/9 y"/n! but that the exponential function can 
also be defined by contour integration, as in Definition 10.26. The contin- 
uity assertion of Theorem 10.27 shows that these definitions coincide (as 
they do for every entire function). 


10.30 Theorem Suppose A is a Banach algebra, x € A, and the spectrum 
a(x) of x does not separate 0 from oo. Then 


(a) x has roots of all orders in A, 
(b) x has a logarithm in A, and 
(c) if > 0, there is a polynomial P such that ||\x~* — P(x)|| < «. 
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Moreover, if o(x) lies in the positive real axis, the roots in (a) can be 
chosen so as to satisfy the same condition. 


PROOF. By hypothesis, 0 lies in the unbounded component of the com- 
plement of o(x). Hence there is a function f, holomorphic in a simply 
connected open set Q > o(x), which satisfies 


exp (f(A)) = 4. 
It follows from Theorem 10.29 that 


exp (f(x) = x, 


so that y = f(x) is a logarithm of x. If 0 < 4 < 00 for every A € a(x), f 
can be chosen so as to be real on o(x), so that o(y) lies in the real axis, 
by the spectral mapping theorem. If z = exp (y/n), then z” = x, and 
another application of the spectral mapping theorem shows that 
a(z) < (0, oo) if o(y) < (— «, 0). This proves (a) and (b); of course (a) 
could have been proved directly, without passing through (5). 

To prove (c), note that 1/A can be approximated by polynomials, 
uniformly on some open set containing o(x) (Runge’s theorem), and 
use the continuity assertion of Theorem 10.27. //// 


These results are not quite trivial even when A is a finite-dimensional 
algebra. For example, it is a special case of (b) that a complex n-by-n matrix 
M is the exponential of some matrix if and only if 0 is not an eigenvalue of 
M, that 1s, if and only if M is invertible. To deduce this from (b), let A be the 
algebra of all complex n-by-n matrices (or the algebra of all bounded linear 
operators on ¢"). 

If some x € A satisfies a polynomial identity, 1e., if P(x) = 0 for some 
polynomial P, then f(x) can always be calculated as a polynomial in x, 
without using the Cauchy integral as in Definition 10.26. If A is finite- 
dimensional, then this remark applies to every x € A. Here are the details: 


10.31 Theorem Let P(A) =(A—a,)" --: (A—a,)™ be a polynomial of 
degree n =m, +--+: +m, and let Q be an open set in € which contains the 
ZerOS X,..., a, Of P. 

If A is a Banach algebra, x € A, and P(x) = 0, then 


(a) a(x) < {a,,..., a,} and 
(b) to every f € H(Q) corresponds a polynomial Q, of degree <n, and a func- 
tion g € H(Q), so that 


(1) f(A) — QA) = PAA) = =UeQ) 
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and 
(2) f (x) = Q(x). 

PROOF. By the spectral mapping theorem, 

(3) P(o(x)) = o(P(x)) = o(0) = {0}. 
This proves (a). 


If all multiplicities m, are 1, Q can be obtained with the aid of 
the Lagrange interpolation formula 


y F(a)P(A) 
i=1 P(a)A — a) 
This gives Q(a;) = f(a;) (1 <i<n); hence (f— Q)/P is holomorphic 
in Q. 

In the general case, the Laurent series of f/P about the points 
Q1,..., 4, give constants c,, so that 


Sa Oe 
~ P(A) i=1 k= frees: 


(4) Q(A) = 


(5) 


is holomorphic in Q. 
This proves (1), and now (2) is a consequence of Theorem 10.27, 
because (1) implies 


(6) f (x) = Q(x) + P(x)G(x), 
and P(x) = 0. [/// 


10.32 Definition Let @(X) be the Banach algebra of all bounded linear 
operators on the Banach space X. The point spectrum o,(T) of an operator 
T € &X) 1s the set of all eigenvalues of T. Thus 4 € o,(T) if and only if the 
null space W(T — AI) of T — AI has positive dimension. 


When A = &(X), the spectral mapping theorem can be refined in the 


following way. 


10.33 Theorem Suppose T € @X), Q is open in ¢, o(T) <Q, and 


fe A(Q). 

(a) Ifx e X,aeQ, and Tx = ax, then f(T)x = f(a)x. 

(b) f(o,{T)) < o,{f(T)). 

(c) Ifaea, f(T)) and f — « does not vanish identically in any component of 
Q, then « € f(a,(T)). 

(d) Iffis not constant in any component of Q, then f(o,(T)) = o,( f(T)). 
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Part (a) states that every eigenvector of T, with eigenvalue «a, is also 
an eigenvector of f(T), with eigenvalue f (cq). 


PROOF. (a) If x = 0 there is nothing to be proved. Assume x # 0 and 
Tx = ax. Then « € o(T), and there exists g € H(Q) such that 


(1) F(A) — f(a) = g(AA — @). 
By Theorem 10.27, (1) implies 
(2) f(T) —f(o)I = G(T\T — al). 


Since (T — al)x = 0, (2) proves (a). 

Thus f(a) is an eigenvalue of f(T) whenever « is an eigenvalue of 
T. It follows that (a) implies (b). 

Under the hypotheses of (c), 


(3) a€a,(f(T)) < o(f(T)) =f(o(T)), 
so that 
(4) f-*(@) 9 o(T) # S. 


Moreover, the set (4) is finite, because o(T) is a compact subset of Q 
and f—«a does not vanish identically in any component of Q. Let 
C1, ..-, ¢, be the zeros of f— a in o(T), counted according to their 
multiplicities. Then 


(5) F(A) — & = GANA — 0) °° A—6,), 
where g € H(Q) and g has no zero on o(T), so that 


(6) F(T) — af = G(T)\T — CI) ++: (7 — C0). 
By (a) of Theorem 10.28, g(T) is invertible in A(X). Since « is an eigen- 
value of f(T), f(T) — al is not one-to-one on X. Hence (6) implies that 
at least one of the operators T — ¢,] must fail to be one-to-one. The 
corresponding ¢; is in o,(T), and since f(¢;) =a the proof of (c) is 
complete. 

Finally, (d) is an immediate consequence of (b) and (c). //// 


The Group of Invertible Elements 


We shall now take a closer look at the structure of G = G(A), the multipli- 
cative group of all invertible elements of a Banach algebra A. 

G, will denote the component of G that contains e, the identity 
element of G. Sometimes G, is called the principal component of G. By the 
definition of component, G, is the union of all connected subsets of G that 
contain e. 
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The group G contains the set 
exp (A) = {exp (x): x € A}, 


the range of the exponential function in A, simply because exp (— x) is the 
inverse of exp (x). In fact, the power series definition of exp (x) yields the 
functional equation 


exp (x + y) = exp (x) exp (y), 


provided that xy = yx; also, exp (0) = e. 
Note also that G is a topological group (see Section 5.12) since multi- 
plication and inversion are continuous in G. 


10.34 Theorem 


(a) G, is an open normal subgroup of G. 

(b) G, is the group generated by exp (A). 

(c) If A is commutative, then G, = exp (A). 

(d) If A is commutative, the quotient group G/G, contains no element of 
finite order (except for the identity). 


PROOF. (a) Theorem 10.11 shows that every x € G, is the center of an 
open ball U < G. Since U intersects G, and U is connected, U c G,. 
Therefore G, is open. 

If x € G, then x~'G, is a connected subset of G which contains 
x 'x = e. Hence x _'G, < G,, for every x € G,. This proves that G, is 
a subgroup of G. Also, y-'G,y is homeomorphic to G,, hence con- 
nected, for every y € G, and contains e. Thus y"'G,y < G,. By defini- 
tion, this says that G, is a normal subgroup of G. 


(b) Let I be the group generated by exp (A). For n = 1, 2, 3,..., 
let E, be the set of all products of n members of exp (A). Since 
y * € exp (A) whenever y € exp (A), I’ is the union of the sets E,. 
Since the product of any two connected sets is connected, induction 
shows that each E, is connected. Each E, contains e, and so E, < G,. 
Hence I is a subgroup of G,. 

Next, exp (A) has nonempty interior, relative to G (see Theorem 
10.30); hence so has [. Since Tis a group and since multiplication by 
any x € Gis a homeomorphism of G onto G, T 1s open. 

Each coset of I" in G, is therefore open, and so is any union of 
these cosets. Since I’ is the complement of a union of its cosets, I" is 
closed, relative to G,. 

Thus IT is an open and closed subset of G,. Since G, is con- 
nected, [ = G,. 
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(c) If A is commutative, the functional equation satisfied by exp 
shows that exp (A) is a group. Hence (b) implies (c). 
(d) We have to prove the following proposition: 


If A is commutative, if x € G, and if x" € G, for some positive integer n, 
then x € G,. 


Under these conditions, x” = exp (a) for some a € A, by (c). Put 
y =exp (n'a) and z= xy". Since y € G,, it suffices to prove that 
ze G,. 

The commutativity of A shows that 


n 


z" = x"y "= exp (a) exp (—a) =e. 


Therefore o(z) does not separate 0 from oo (it consists of at most n 
points, lying on the unit circle), and this implies, by Theorem 10.30, 
that z = exp (w) for some w e A. Put 


F(A) = exp (Aw). 


Then f: ¢ > G is continuous, f(0) = e € G,, hence f(€) < G,. In par- 
ticular, z = f(1) € G,. //// 


Theorem 12.38 will show that exp (A) is not always a group. 


Lomonosov’s Invariant Subspace Theorem 


An invariant subspace of an operator T € @(X) 1s, by definition, a closed 
subspace M of X such that M # {0}, M # X, and Tx € M forevery x e M; 
or, briefly, T(M) <c M. 

The question arises (and was asked more than half a century ago) 
whether it is true, for every complex Banach space X, that every T € BX) 
has an invariant subspace. In recent years some counterexamples have been 
constructed in some nonreflexive spaces, and even in 7’. (References are 
given in Appendix B.) Positive results have been found for certain classes of 
operators on a Hilbert space (in particular, for normal operators; see 
Chapter 12), but even there the general question is still open. 

Lomonosov’s proof of the following striking theorem used Schauder’s 
fixed point theorem to produce an eigenvalue (namely, 1). T. M. Hilden 
observed that this could also be done by an appeal to the spectral radius 
formula; the resulting proof is a slight simplification of the original one. 


10.35 Theorem Suppose that X is an infinite-dimensional complex 
Banach space and that T € @(X) is compact, T # 0. 
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Then there is a closed subspace M of X such that M # {0}, M# X, 
and 


(1) S(M)< M 


for every S € BX) that commutes with T. 


Observe, as a corollary, that every S « @(X) which commutes with 
some nonzero compact operator has an invariant subspace! 


PROOF. Let us introduce the notations 

(2) lr = {S € &(X): ST = TS} 
and, for each ye X, 

(3) ly) = {Sy: Se Th. 


It is easy to see that I is a closed subalgebra of @(X) and that I(y) is 
therefore a closed subspace of X which contains y. Thus I'(y) # {0} if 
y #0. Moreover, 


(4) ST (y)) < TV) 
for all ye X and S ET, simply because I is closed under multiplica- 
tion. Thus (1) holds for every I(y). 

If the conclusion of the theorem is false, it follows that I(y) = X 
for every y #0. 

Let us assume this. 

Pick x) € X so that Tx, # 0. Then x, 4 0, and the continuity of 
T shows that there is an open ball B, centered at x,, so small that 


(5) |Tx|| =2NTxoll and — |x|] > 2llxoll 


for every x © B. Our assumption about I(y) implies that every y #0 
has a neighborhood W which is mapped into the open set B by some 
S eI. Since T is a compact operator, K = T(B) is a compact set. By 
(5), 0¢ K. Therefore there are open sets W,, .... W,, whose union 
covers K, such that S(W,) < B for some S; ET, 1 <i<n. Put 


(6) p= max {Sillsse0s Sill}. 


Starting with x9, TX, lies in K, hence in some W,,, and S;, Tx € 
B. Therefore TS;,7 Xo lies in K, hence in some W,,, and S,;,TS;, TX 9 1s 
back in B. Continuing this ping-pong game, we obtain vectors 


(7) Xy =8,,T °°: S,,TXo9 =Si, °°: NOW baie on 
in B. Hence 


(8) 21IXoll < lxwll <HNIT™ | Ixoll = (N = 1, 2, 3, ..), 
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and this gives information about the spectral radius of T, namely, 


(9) A(T) = lim |TM* >t > 0, 
U 


No 


We now invoke Theorem 4.25. Since p(T) > 0, T has an eigen- 
value A # 0. The corresponding eigenspace 


(10) M,= {xe X: Tx = Ax} 
is finite-dimensional; hence M, # X. If Se T and x € M,, then 
(11) T(Sx) = S(T x) = S(Ax) = ASx, 


so that Sx e M,. This says that S(M,) < M,. 
Thus M, satisfies the conclusion of the theorem, even though we 
assumed that this conclusion failed. //// 


Exercises 


Throughout this set of exercises, A denotes a Banach algebra and x, y, ... denote 
elements of A, unless the contrary is stated. 


1. Use the identity (xy)" = x(yx)"~'y to prove that xy and yx always have the same 
spectral radius. 
2. (a) If x and xy are invertible in A, prove that y is invertible. 
(b) If xy and yx are invertible in A, prove that x and y are invertible. (The 
commutative case of this was tacitly used in the proofs of Theorems 10.13 
and 10.28.) 
(c) Show that it is possible to have xy = e but yx # e. For example, consider the 
right and left shifts S, and S,, defined on some Banach space of functions f 
on the nonnegative integers by 


(SrfXn)=f(n—1) ifnel, 
(Sz SO) = 0, 
(S, fn) =f(n + 1) for all n > 0. 


(d) If xy=e and yx =z e, show that z is a nontrivial idempotent. (This 
means that z? = z,z #0,z #e,) 

3. Prove that every finite-dimensional A is isomorphic to an algebra of matrices. 
Hint: The proof of Theorem 10.2 shows that every A is isomorphic to a sub- 
algebra of @(A). Conclude that xy = e implies yx = e if dim A < o. 

4. (a) Prove that e — yx is invertible in A if e — xy is invertible. Suggestion: Put 
z =(e— xy) ', write z as a geometric series (assume ||x|| < 1, ||y|| < 1), and 
use the identity stated in Exercise 1 to obtain a finite formula for (e — yx)~' 
in terms of x, y, z. Then show that this formula works without any 
restrictions on ||x|| or |ly||. 

(b) If AE €¢, A#0, and Ae o(xy), prove that A € o(yx). Thus o(xy) vu {0} = 
o(yx) U {0}. Show, however, that o(xy) is not always equal to o( yx). 
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10. 


11. 


12. 


13. 
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Let A, and A, be the algebras of all complex 2-by-2 matrices of the form 


cw wa 


Prove that every two-dimensional complex algebra A with unit e is isomorphic 
to one of these and that A, is not isomorphic to A,. Hint: Show that A has a 
basis {e, a} in which a? = Ae for some Ae ¢. Distinguish between the cases 
A=0, 4#0. Show that there exists a three-dimensional noncommutative 
Banach algebra. 


. Let A be the algebra of all complex functions f on {1, 2, 3, ...} which are 0 


except at finitely many points, with pointwise addition and multiplication and 
norm 


If l= Lk. 


Show that multiplication is left-continuous (hence also right-continuous, since A 
is commutative) but not jointly continuous. (Adjunction of a unit element, as 
suggested in Section 10.1, would make no difference.) Show, in fact, that there is 
a sequence { f,} in A so that || f,|| > 0 but || f2|| + 00, asn— oo. 


. Let C? = C7([0, 1]) be the space of all complex functions on [0, 1] whose 


second derivative is continuous. Choose a > 0, b > 0, and define 


Fl = WF lo + all f lls + OIF lle» 


Show that this makes C? into a Banach space, for every choice of a, b, but that 
the Banach algebra axioms hold if and only if 2b < a”. (For necessity, consider x 
and x7.) 


. What happens if the process of adjoining a unit (described in Section 10.1) is 


applied to an algebra A which already has a unit? Clearly, the result cannot be 
an algebra A, with two units. Explain. 


. Suppose that Q is open in ¢ and that f: Q- A and g: Q- ¢ are holomorphic. 


Prove that of : Q— A is holomorphic. [This was used in the proof of Theorem 
10.13, with g(A) = A”.] 
Another proof that o(x) 4 @ can be based on Liouville’s theorem 3.32 and the 
fact that (Ae — x)’ +0 as A-— oo. Complete the details. 
Call x € A a topological divisor of zero if there is a sequence {y,} in A, with 
lly, {| = 1, such that 
lim xy, = 0 = lim y, x. 

(a) Prove that every boundary point x of the set of all invertible elements of A is 

a topological divisor of zero. Hint: Take y, = x, */||x, '||, where x, > x. 
(b) In which Banach algebras is 0 the only topological divisor of 0? 
Find the spectrum of the operator T € @(¢7) given by 


T(X1, X2, X3, X45 ..) =(—x2, X15 = as X3, aise) 


Suppose K = {Ae ¢:1<|A| <2}. Put f(A) =A. Let A be the smallest closed 
subalgebra of C(K) that contains 1 and f Let B be the smallest closed sub- 
algebra of C(K) that contains f and 1/f, Describe the spectra a ,( f) and o,( f). 


14. 


15. 


16. 


17. 


18. 


19, 


20. 
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(a) Fubini’s theorem was applied to vector-valued integrals in the proof of 
Theorem 10.29. Justify this. 

(b) Construct a proof of Theorem 10.29 that uses no contour integrals, as 
follows: Prove the theorem first for polynomials g, then for rational func- 
tions g with no poles in Q,, and obtain the general case from Runge’s 
theorem. 

Suppose X is a Banach space, T € &@(X) is compact, and ||T"|| > 1 for all n > 1. 

Prove that the point spectrum of T is not empty. 

Let X = C([0, 1]) and define T € @(X) by 


s 


(TxXs) = | x(u) du (0<s <1). 
0 

Show that o,(T) = @. Hence f(a,(T)) = @ for every f, but if f = 0, then f(T) = 

0; hence 


o(f(T)) = 4,(0) = {0} 4 Z. 


The extra hypotheses in (c) and (d) of Theorem 10.33 are thus needed. 
Suppose that the spectrum of some x € A is not connected. Prove that A con- 
tains a nontrivial idempotent z. (This is defined in Exercise 2.) 

Prove also that A = Ay @ A,, where 


A, = {x:2zx=0}, A, ={x: zx =x}. 


Suppose Q is open in @, « is an isolated boundary point of Q, f:Q—-> xX isa 
holomorphic X-valued function (where X is some complex Banach space), and 
there is a smallest nonnegative integer n such that 


[A—aPI FAI 


is bounded as A > a. If n > 0, then fis said to have a pole of order n at a. 

(a) Suppose x € A and (Ae — x)~* has a pole at every point of o(x). [Note that 
this can happen only when o(x) is a finite set.] Prove that there is a non- 
trivial polynomial P such that P(x) = 0. 

(b) As a special case of (a), assume o(x) = {0} and (Ae — x)~* has a pole of order 
n at 0. Prove that x” = 0. 

Let Sp be the right shift, acting on 77, as in Exercise 2. Let {c,} be a sequence of 

complex numbers such that c, # 0 but c, > 0 as n> 00. Define M € &(¢7) by 


(MfXn)=c,f(n) (n> 0), 


and define T « &(¢*) by T = MS. 

(a) Compute ||T” ||, for m = 1, 2, 3,.... 

(b) Show that o(T) = {0}. 

(c) Show that T has no eigenvalue. (Its point spectrum is therefore empty, 
although its spectrum consists of a single point!) 

(d) Show that (AJ — T)~* does not have a pole at 0. 

(e) Show that T is a compact operator. 

Suppose x € A, x, € A, and lim x, = x. Suppose is an open set in ¢ that 

contains a component of o(x). Prove that o(x,) intersects Q for all sufficiently 
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21. 


22. 


23. 


24. 


25. 


26. 


large n. (This strengthens Theorem 10.20.) Hint: If o(x) <Q U OQ), where Q, is 

an open set disjoint from Q, consider the function f that is 1 in Q,01n Qo. 

Let Cp, be the algebra of all real continuous functions on [0, 1], with the 

supremum norm. This satisfies all requirements of a Banach algebra, except that 

the scalars are now real. 

(a) If O(f) = Jo f(t) at, then P(1) = 1, and ¢(f) # 0 if fis invertible in Cz, but 
is not multiplicative. 

(b) If G and G, are defined in Cp, as in Theorem 10.34, show that G/G, is a 
group of order 2. 

The analogues of Theorem 10.9 and (d) of Theorem 10.34 are thus false 
for real scalars. Exactly where would the proof of (d) of Theorem 10.34 break 
down? 

Suppose A = C(T), the algebra of all continuous complex functions on the unit 
circle T, with the supremum norm. Show that two invertible members of C(T) 
are in the same coset of G, if and only if they are homotopic mappings of T into 
the set of all nonzero complex numbers. Deduce from this that G/G, is iso- 
morphic to the additive group of the integers. (The notation is as in Theorem 
10.34.) 

Suppose A = M(R), the convolution algebra of all complex Borel measures on 
the real line; see (e) of Example 10.3. Supply the details in the following proof 
that G/G, is uncountable: If a € R, let 6, be the unit mass concentrated at a. 
Assume 6,¢€G,. Then 6,=exp(u,) for some yp, e€ M(R); hence, for 
—-0o<t<0o, 


—iat = p(t) + 2kzi, 


where k is an integer. Since ji, is a bounded function, « = 0. Thus 6, is the only 

6, 1n G,. No coset of G, in G contains therefore more than one 6, . 

(a) Prove that A is commutative if there is a constant M < oo such that 
l|xy|| < M|lyx|| for all x and yin A. Hint: ||w~*yw|| < M|lyll if w is invertible 
in A. Replace w by exp (Ax), where xe A and Ae. Continue as in 
Theorem 12.16. 

(b) Prove that A is commutative if ||x?|| = ||x||? for every x € A. Hint: Show 

that ||x|| = p(x). Use Exercise 1 to deduce that ||w~ ‘yw]| = ||y||. Continue as in 

(a). 

As regards the invariant subspace problem, described in the introduction to 

Theorem 10.35, explain why the problem is 

(a) trivial in ¢", 

(b) different in R", 

(c) uninteresting if X is not separable. 

How should Lomonosov’s theorem be reformulated when X = ¢"? 

Let S, be the right shift on 77, as in Exercise 2. Prove that 0 is the only compact 

T € &¢7) that commutes with Sp. Hint: If T 4 0 then 


| T(Szx) — T(SRx)l| 


does not tend to O when N — M > oo. 


CHAPTER 


COMMUTATIVE 
BANACH 
ALGEBRAS 


This chapter deals primarily with the Gelfand theory of commutative 
Banach algebras, although some of the results of this theory will be applied 
to noncommutative situations. The terminology of the preceding chapter 
will be used without change. In particular, Banach algebras will not be 
assumed to be commutative unless this is explicitly stated, but the presence 
of a unit will be assumed without special mention, as will the fact that the 
scalar field is ¢. 


Ideals and Homomorphisms 


11.1 Definition A subset J of a commutative complex algebra A is said 
to be an ideal if 


(a) J isasubspace of A (in the vector space sense), and 


(b) xy e J whenever x € Aand ye J. 


If J # A, J is a proper ideal. Maximal ideals are proper ideals which 
are not contained in any larger proper ideal. 
11.2 Proposition 


(a) No proper ideal of A contains any invertible element of A. 
(b) If J is an ideal in a commutative Banach algebra A, then its closure J is 
also an ideal. 


The proofs are so simple that they are left as an exercise. 
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11.3. Theorem 


(a) If A is a commutative complex algebra with unit, then every proper ideal 
of A is contained in a maximal ideal of A. 


(b) If A is a commutative Banach algebra, then every maximal idea of A is 
closed. 


PROOF. (a) Let J be a proper ideal of A. Let P be the collection of all 
proper ideals of A that contain J. Partially order Y by set inclusion, 
let 2 be a maximal totally ordered subcollection of F (the existence of 
2 is assured by Hausdorff’s maximality theorem), and let M be the 
union of all members of 2. Being the union of a totally ordered collec- 
tion of ideals, M is an ideal. Obviously J c M, and M # A since no 
member of Y contains the unit of A. The maximality of 2 implies that 
M is a maximal ideal of A. 

(b) Suppose M is a maximal ideal in A. Since M contains no 
invertible element of A and since the set of all invertible elements is 
open, M contains no invertible element either. Thus M is a proper 
ideal of A, and the maximality of M shows therefore that M = M. //// 


11.4 Homomorphisms and quotient algebras If A and B are commu- 
tative Banach algebras and ¢ is a homomorphism of A into B (see Section 
10.4) then the null space or kernel of ¢ is obviously an ideal in A, which is 
closed if ¢@ is continuous. 

Conversely, suppose J is a proper closed ideal in A and 2: A > A/J is 
the quotient map, as in Definition 1.40. Then A/J is a Banach space, with 
respect to the quotient norm (Theorem 1.41). We will show that A/J is 
actually a Banach algebra and that x is a homomorphism. 

If x’ —x e J and y’ — ye J, the identity 


(1) x'y — xy =(x' — x)y’ + x(y' — y) 


shows that x’y’ — xy e J; hence n(x’y’) = a(xy). Multiplication can therefore 
be unambiguously defined in A/J by 


(2) mx)n(y) = (xy) (xe A, ye A). 


It is then easily verified that A/J is a complex algebra and that z is a 
homomorphism. Since ||2(x)|| < ||x||, by the definition of the quotient norm, 
7 is continuous. 

Suppose x; € A (i = 1, 2) and 6 > 0. Then 


(3) Ix; + yill S Ia) +0 (= 1, 2) 


for some y, € J, by the definition of the quotient norm. Since 
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(x, + yil(X2 + Y2) € Xx, + J, 


we have 

(4) |m(XX2)I] < [1 + YM %2 + Yodll S UX + yrllllx2 + Yall, 
so that (3) implies the multiplicative inequality 

(5) | (x4 )a(x2)l] < IIx Il ey) I]. 


Finally, if e is the unit element of A, then (2) shows that x(e) is the 
unit of A/J, and since x(e) #0, (5) shows that ||z(e)|| > 1 = |lel|. Since 
|| z(x)|| < ||x|| for every x € A, ||z(e)|| = 1. This completes the proof. 

Part (a) of the next theorem is one of the key facts of the whole theory. 
The set A that appears in it will later be given a compact Hausdorff topol- 
ogy (Theorem 11.9). The study of commutative Banach algebras will then to 
a large extent be reduced to the study of more familiar (and more special) 
objects, namely, algebras of continuous complex functions on A, with point- 
wise addition and multiplication. However, Theorem 11.5 has interesting 
concrete consequences even without the introduction of this topology. Sec- 
tions 11.6 and 11.7 illustrate this point. 


11.55. Theorem Let A be a commutative Banach algebra, and let A be the 
set of all complex homomorphisms of A. 


(a) Every maximal ideal of A is the kernel of some h € A. 
(b) IfheA, the kernel of h is a maximal ideal of A. 
(c) An element x € A is invertible in A if and only if h(x) 4 O for every he A. 


(d) An element x € A is invertible in A if and only if x lies in no proper ideal 
of A. 
(e) A € o(x) if and only if h(x) = A for some he A. 


PROOF. (a) Let M be a maximal ideal of A. Then M is closed 
(Theorem 11.3) and A/M is therefore a Banach algebra. Choose x e 4, 
x ¢ M, and put 


(1) J ={ax+y:aeA,yeM}. 


Then J is an ideal in A which is larger than M, since x € J. (Take 
a=e, y=0.) Thus J = A, and ax + y =e for some ae A, ye M. If 
mt: A — A/M is the quotient map, it follows that 2(a)n(x) = n(e). Every 
nonzero element z(x) of the Banach algebra A/M is therefore invert- 
ible in A/M. By the Gelfand-Mazur theorem, there is an isomorphism 
j of A/M onto ¢. Put h =j o x. Then he A, and M is the null space 
of h. 


278 PART III: BANACH ALGEBRAS AND SPECTRAL THEORY 


(b) If he A, then h~*(0) is an ideal in A which is maximal 
because it has codimension 1. 
(c) If x is invertible in A and h ¢€ A, then 


h(x)h(x~*') = h(xx~*) = h(e) = 1, 


so that h(x) # 0. If x is not invertible, then the set {ax: a € A} does 
not contain e, hence is a proper ideal which lies in a maximal one 
(Theorem 11.3) and which is therefore annihilated by some he A, 
because of (a). 

(d) No invertible element lies in any proper ideal. The converse 
was proved in the proof of (c). 


(e) Apply (c) to Ae — x in place of x. //// 


Our first application concerns functions on R" that are sums of absol- 
utely convergent trigonometric series. The notation is as in Exercise 22 of 
Chapter 7. 


11.6 Wiener’s lemma Suppose f is a function on R", and 


(1) f(x) = Vane *, J |ag| < 0, 


where both sums are extended over all me Z". If f(x) #0 for every x € R", 
then 


it su 
(2) rr b,, @ with > |b,,| < ©. 


PROOF. Let A be the set of functions of the form (1), normed by 
lf \| => |a,,|. One checks easily that A is a commutative Banach 
algebra, with respect to pointwise multiplication. Its unit is the con- 
stant function 1. For each x, the evaluation f— f(x) 1s a complex 
homomorphism of A. The assumption about the given function f is 
that no evaluation annihilates it. If we can prove that A has no other 
complex homomorphisms, (c) of Theorem 11.5 will imply that f is 
invertible in A, which is exactly the desired conclusion. 

For r= 1,..., n, put g,(x) = exp (ix,), where x, is the rth coordi- 
nate of x. Then g, and 1/g, are in A and have norm 1. If he A, it 

h(g,) 


follows from (c) of Theorem 10.7 that 
1 
“4G 
Dy 
Hence there are real numbers y, such that 


(3) hg,) = exp (iy,)=9,y) (Usrsn), 


t 


<i. 


|h(g,)| < 1 and 
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where y =(j,,..., y,). If P is a trigonometric polynomial (which 
means, by definition, that P is a finite linear combination of products 
of integral powers of the functions g, and 1/g,), then (3) implies 


(4) h(P) = P(y), 

because h is linear and multiplicative. Since h is continuous on A 
(Theorem 10.7) and since the set of all trigonometric polynomials is 
dense in A (as is obvious from the definition of the norm), (4) implies 
that h(f) = f(y) for every fe A. Thus h is evaluation at y, and the 
proof is complete. iii] 


This lemma was used (with n = 1) in the original proof of the tau- 
berian theorem 9.7. To see the connection, let us reinterpret the lemma. 
Regard Z" as being embedded in R" in the obvious way. The given coeffi- 
cients a,, define then a measure p on R", concentrated on Z”, which assigns 
mass a,, to each me Z". Consider the problem of finding a complex 
measure o, concentrated on Z", such that the convolution p * o is the Dirac 
measure 6. Wiener’s lemma states that this problem can be solved if (and 
trivially only if) the Fourier transform of yu has no zero on R"; this is pre- 
cisely the tauberian hypothesis in Theorem 9.7. 

For our next application, let U" be the set of all points z = (z,, ..., z,,) 
in ¢" such that |z;| < 1 for 1 <i<n. In other words, this polydisc U" is the 
cartesian product of n copies of the open unit disc U in ¢. We define A(U") 
to be the set of all functions f that are holomorphic in U" (see Definition 
7.20) and that are continuous on its closure U”. 


11.7 Theorem Suppose f,, ..., f, €¢ A(U"), and suppose that to each 
z € U" there corresponds at least one i such that f{z) #0. Then there exist 
functions $1, ..., 6, € A(U") such that 


(1) fiZo(2) +--+ fl2oz)=1 (ze U"). 


PROOF. A = A(U") is a commutative Banach algebra, with pointwise 
multiplication and the supremum norm. Let J be the set of all sums 
>: f,0;, with @; € A. Then J is an ideal. If the conclusion is false, then 
J # A; hence J lies in some maximal ideal of A (Theorem 11.3), and 
some h € A annihilates J, by (a) of Theorem 11.5. 

For 1 <r<n, put g,(z) =z,. Then |/g,|| = 1; hence h(g,) = w,, 
with |w,| < 1. Put w = (w,,..., w,). Then w e U", and h(g,) = g,(w). It 
follows that h(P) = P(w) for every polynomial P, since h is a homo- 
morphism. The polynomials are dense in A(U") (Exercise 4). Hence 
h(f) =f (w) for every f € A, by essentially the same argument that was 
used in the proof of Theorem 11.6. 

Since h annihilates J, f(w) = 0 for 1 <i < k. This contradicts the 


hypothesis. [II 
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Gelfand Transforms 


11.8 Definitions Let A be the set of all complex homomorphisms of a 
commutative Banach algebra A. The formula 


(1) X(h) = h(x) (he A) 


assigns to each x € A a function x: A— ¢; we call X the Gelfand transform 
of x. 

Let A be the set of all %, for x € A. The Gelfand topology of A is the 
weak topology induced by A, that is, the weakest topology that makes 
every X continuous. Then obviously A < C(A), the algebra of all complex 
continuous functions on A. 

Since there is a one-to-one correspondence between the maximal 
ideals of A and the members of A (Theorem 11.5), A, equipped with its 
Gelfand topology, is usually called the maximal ideal space of A. 

The term “Gelfand transform” is also applied to the mapping x — xX of 
A onto A. 

The radical of A, denoted by rad 4, is the intersection of all maximal 
ideals of A. If rad A = {0}, A is called semisimple. 


11.9 Theorem Let A be the maximal ideal space of a commutative 
Banach algebra A. 


(a) Ais acompact Hausdorff space. 

(b) The Gelfand transform is a homomorphism of A onto a subalgebra A of 
C(A), whose kernel is rad A. The Gelfand transform is therefore on iso- 
morphism if and only if A is semisimple. 

(c) For each x é€ A, the range of X is the spectrum o(x). Hence 

IXIlo = p(x) < IIx, 
where ||X||,, is the maximum of | X(h)| on A, and x € rad A if and only if 
p(x) = 9. 


PROOF. We first prove (b) and (c). Suppose x € A, ye A,ae GC, head. 
Then 


(ax)"*(h) = h(ax) = ah(x) = (ax)h), 
(x + y)*(h) = h(x + y) = A(x) + hy) = XA) + HA) = (& + 5A), 
and 
(xy)*(h) = h(xy) = h(x)h(y) = X(h) Wh) = (XY). 


Thus x > X is a homomorphism. Its kernel consists of those x € A 
which satisfy h(x) = 0 for every h e A; by Theorem 11.5, this is the 
intersection of all maximal ideals of A, that is, rad A. 
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To say that A is in the range of x means that A = x(h) = h(x) for 
some heA. By (e) of Theorem 11.5, this happens if and only if 
A € a(x). This proves (b) and (c). 

To prove (a), let A* be the dual space of A (regarded as a 
Banach space), and let K be the norm-closed unit ball of A*. By the 
Banach-Alaoglu theorem, K is weak*-compact. By (c) of Theorem 
10.7, Ac K. The Gelfand topology of A is evidently the restriction to 
A of the weak*-topology of A*. It is therefore enough to show that A 
is a weak*-closed subset of A*. 

Let A, be in the weak*-closure of A. We have to show that 


(1) Ao(xy)= AgpxAgy = (x€ A, ye A) 
and 
(2) Ao e = 1. 


[Note that (2) is necessary; otherwise A, would be the zero homo- 
morphism, which is not in A. ] 
Fix xe A,ye A,e>O. Put 


(3) W = {Ae A*: | Az; — Agz;| < ¢ for 1 <i < 4}, 


where z,; =e, 2,=X, 23=y, 24=xy. Then W is a_ weak*- 
neighborhood of A, which therefore contains an h ¢€ A. For this h, 


(4) |1 — Age| =| h(e) — Apel <6, 
which gives (2), and 
No{xy) — Ao XAg y = [Ao(xy) — h(xy)] + La@h(y) — Ao xAo YI 
= [Ao(xy) — A(xy)] + Thy) — Ao yJh(x) 
+ [h(x) — Ao x]Ao J; 
which gives 
(5) | Ao(xy) — Ag XAg y| < (1 + [xl] + | Ao y De. 
Since (5) implies (1), the proof is complete. HTH 


Semisimple algebras have an important property which was earlier 
proved for ¢: 


11.10 Theorem If Ww: A-—B is a homomorphism of a Banach algebra A 
into a semisimple commutative Banach algebra B, then W is continuous. 


PROOF. Suppose x, > x in A and W(x,)— y in B. By the closed graph 
theorem, it is enough to show that y = (x). 
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Pick some homomorphism h: B>~€. Then g=hcow is a 
homomorphism of A into €. Theorem 10.7 shows that h and @ are 
continuous. Hence 


h(y) = lim A(p(x,)) = lim g(x,) = g(x) = h(y(x)), 
for every h € Ay. Thus y — (x) € rad B. Since rad B = {0}, y = W(x). 
LL 


Corollary. Every isomorphism between two semisimple commutative 
Banach algebras is a homeomorphism. 


In particular, this is true of every automorphism of a semisimple 
commutative Banach algebra. The topology of such an algebra is therefore 
completely determined by its algebraic structure. 

In Theorem 11.9, the algebra A may or may not be closed in C(A), 
with respect to the supremum norm. Which of these cases occurs can 
be decided by comparing ||x?|| with ||x||?, for all x eA. Recall that 
|x || < ||x||? is always true. 


11.11. Lemma _ [fA is a commutative Banach algebra and 


ee 
(1) inf = int > s = inf ne “1 (x € A, x £0), 
then s?<r<s. 

PROOF. Since ||X||,, > sl|xll, 

(2) x7] > 1X7, = WEN = 87 Ill]? 


for every x € A. Thus s? <r. 
Since ||x?|| > r||x||? for every x € A, induction on n shows that 


(3) x™|| =r" Ix™ (m= 2", n= 1, 2, 3,..)). 


Take mth roots in (3) and let m— oo. By the spectral radius formula 
and (c) of Theorem 11.9, 


(4) IXllo = p(x) 2 r|/xll (x € A). 
Hence r < s. H/1 


11.12 Theorem Suppose A is a commutative Banach algebra. 


(a) The Gelfand transform is an isometry (that is, ||x|| = ||X||,, for every 
x € A) if and only if ||x?\| = ||x||? for every x € A. 

(b) A is semisimple and A is closed in C(A) if and only if there exists K < 0 
such that ||x||* < K||x?|| for every x € A. 
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PROOF. (a) In the terminology of Lemma 11.11, the Gelfand transform 
is an isometry if and only if s = 1, which happens (by the lemma) if 
and only ifr = 1. 

(b) The existence of K is equivalent to r > 0, hence to s > 0, by 
the lemma. If s > 0, then x > X is one-to-one and has a continuous 
inverse, so that A is complete (hence closed) in C(A). Conversely, if 
x +X is one-to-one and if A is closed in C(A), the open mapping 
theorem implies that s > 0. //// 


11.13. Examples In some cases, the maximal ideal space of a given 
commutative Banach algebra can easily be described explicitly. In others, 
extreme pathologies occur. We shall now give some examples to illustrate 
this. 


(a) Let X be a compact Hausdorff space, put A = C(X), with the 
supremum norm. For each x e X, f— f(x) is a complex homomorphism h,. 
Since C(X) separates points on X (Urysohn’s lemma), x # y implies 
h, # h,. Thus x > h, embeds X in A. 

We claim that each he A is an h,. If this is false, there is a maximal 
ideal M in C(X) which contains, for each p € X, a function f with f(p) # 0. 
The compactness of X implies then that M contains finitely many functions 
fi, .--,f, such that at least one of them is #0 at each point of X. Put 


g hid: ae +fuaby- 


Then g € M, since M is an ideal; g > 0 at every point of X; hence g is 
invertible in C(X). But proper ideals contain no invertible elements. 

Thus x<«+h, is one-to-one correspondence between X and A and can 
be used to identify A with X. This identification is also correct in terms of 
the two topologies that are involved: The Gelfand topology y of X is the 
weak topology induced by C(X) and is therefore weaker than 1, the original 
one, but y is a Hausdorff topology; hence y = t. [See (a) of Section 3.8. | 

Summing up, X “is” the maximal ideal space of C(X), and the 
Gelfand transform is the identity mapping on C(X). 


(b) Let A be the algebra of all absolutely convergent trigonometric 
series, as in Section 11.6. We found there that the complex homomorphisms 
are the evaluations at points of R”. Since the members of A are 27-periodic 
in each variable, A is the torus T” obtained from R” by the mapping 
ix, ixn 


(X15 --+5 X,) > (e!, ...,e 


This is an example in which A is dense in C(A), although A 4 C(A). 


(c) In the same way, the proof of Theorem 11.7 contains the result 
that U" is the maximal ideal space of A(U"). The argument used at the end 
of (a) shows that the natural topology of U" is the same as the Gelfand 
topology induced by A(U"); the same remark applies to (b). 
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(d) The preceding example has interesting generalizations. Let A now 
be a commutative Banach algebra with a finite set of generators, say X,, ..., 
x,- Lhis means that x; € A (1 <i <n) and that the set of all polynomials in 
X1,.-+-, X, 18 dense in A. Define 


(1) P(h) = (X,(h), ..., Xa(h)) (he A). 


Then @ is a homeomorphism of A onto a compact set K < €". Indeed, @ is 
continuous since A c C(A). If (h,) = o(h,), then h,(x,) = h,(x,) for all i; 
hence h,(x) = h(x) whenever x is a polynomial in x,, ..., x,, and since 
these polynomials are dense in A, h, = h,. Thus ¢ is one-to-one. 

We can now transfer A from A to K and may thus regard K as the 
maximal ideal space of A. To make this precise, define 


(2) Wx)=Xop i (x € A). 


Then wy is a homomorphism (an isomorphism if A is semisimple) of A onto 
a subalgebra (A) of C(K). One verifies easily that 


(3) W(x)\(z)=2; ifz=(2,,..., 2,) 6K, 
and therefore 
(4) W(P(X1,---, X,)(Z) = P(z) (ze K) 


for every polynomial P in n variables. 

It follows that every member of (A) is a uniform limit of polynomials, 
on K. 

The sets K < ¢" which arise in this fashion as maximal ideal spaces 
have a property known as polynomial convexity: 


If we ¢" and w ¢€ K, there exists a polynomial P such that | P(z)| < 1 
for every z € K, but | P(w)| > 1. 


To prove this, assume there is no such polynomial. The norm- 
decreasing property of the Gelfand transform implies then that 


(5) | P(w)| < |] P(1; ---5 Xa)ll 


for every polynomial P; the norm is that of A. Since {x,, ..., x,} is a set of 
generators of A, it follows from (5) that there is an h € A such that ¢(h) = w. 
But then w e€ K, and we have a contradiction. 

The compact polynomially convex subsets of ¢ are simply those 
whose complement is connected; this is an easy consequence of Runge’s 
theorem. In ¢”, the structure of the polynomially convex sets is by no 
means fully understood. 


(e) Our next example shows that the Gelfand transform is a gener- 
alization of the Fourier transform, at least in the L’-context. 
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Let A be L’(R") with a unit attached, as described in (d) of Section 
10.3. The members of A are of the form f + «6, where f € L'(R"), a € ¢, and 
6 is the Dirac measure on R"”; multiplication in A is convolution: 


(f + ad) * (g + Bo) =(f * 9g + Bf + ag) + aBo. 


For each t € R", the formula 


(6) h(f + 06) =f(t) + « 


defines a complex homomorphism of A; here fis the Fourier transform of f. 
In addition, 


(7) h,(f + «6) = a 


also defines a complex homomorphism. There are no others. (A proof will 
be sketched presently.) Thus A, as a set, is R"” U {co}. Give A the topology 
of the one-point compactification of R". Since f(t) > 0 as |t| —> 00, for every 
f € L(R"), it follows from (6) and (7) that 4 < C(A). Since A separates points 
on A, the weak topology induced on A by A is the same as the one that we 
just chose. 

It remains to be proved that every he A is of the form (6) or (7). If 
h(f)=0 for every fe L'(R"), then h=h,,. Assume h(f) 40 for some 
feL(R"). Then h(f)=J fBdm,, for some fBeL*(R"). Since 
h(f * g) = h(f)h(g), one can prove that B coincides almost everywhere with 
a continuous function b which satisfies 


(8) B(x + y) = B(x)b(y) (x, y € R”). 
Finally, every bounded solution of (8) is of the form 
(9) b(x) =e"! (x € R") 


A 


for some t € R". Thus h(f) = f(t), and h has the form (6). 
For n= 1, the details that complete the preceding sketch may be 
found in Sec. 9.22 of [23]. The case n > 1 is quite similar. 


(f) Our final example is L*(m). Here m is Lebesgue measure on the 
unit interval [0,1], and L°(m) is the usual Banach space of equivalence 
classes (modulo sets of measure 0) of complex bounded measurable func- 
tions on [0, 1], normed by the essential supremum. Under pointwise multi- 
plication, this is obviously a commutative Banach algebra. 

If fe L°(m) and G, is the union of all open sets Gc with 
m(f ~'(G)) = 0, then the complement of G, (called the essential range of f) is 
easily seen to coincide with the spectrum oa(/) of f, hence with the range of 
its Gelfand transform f. It follows that f is real if f is real. Hence L°(m)‘ is 
closed under complex conjugation. By the Stone-Weierstrass theorem, 
L*(m)* is therefore dense in C(A), where A is the maximal ideal space of 
L°(m). It also follows that ff is an isometry, so that L°(m)‘ is closed in 
C(A). 
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We conclude that f > f is an isometry of L°(m) onto C(A). 

Next, f + | f dm is a bounded linear functional on C(A). By the Riesz 
representation theorem, there is therefore a regular Borel probability 
measure p on A that satisfies 


(10) | fau= [sam 
A 0 


for every fe L*(m). This measure is related to the topology of A in the 
following way: 


(i) pV) > Oif V is open and nonempty. 
(ii) To every bounded Borel function g on A corresponds an f € C(A) such 
that f= 9 ae. [p]. 
(iii) If V is open, so is V. 
(iv) If E isa Borel set in A, then 


(11) u(E°) = p(E) = p(E). 


If V is as in (i), Urysohn’s lemma implies that there is an f € C(A), 
f> 0, such that f = 0 outside V and f(p) = 1 at some p € V. Hence f is not 
the zero element of L°(m), and the integrals (10) are positive. This gives (i). 

In (ii), assume | g| < 1. Since C(A) is dense in L7(y) (recall that p is a 
regular Borel probability measure), there are functions f, € C(A) such that 
) lf, — o|? du-0, and which can be so adjusted that | f,.| <1. Then 
ll falloo < 1, and (10) implies that {f,} is a Cauchy sequence in I?(m). Hence 
there is an f € L°(m) such that 


(12) [n-7F in= | ‘f-fim 
A (0) 


asn— oo. Thus g = fae. [py]. 

Next, let V be open and let W be the complement of V. By (ii) there is 
an f € C(A) such that f = 1 ae. [uv] off V. The set on which f is neither 1 nor 
0 is open and has p-measure 0, hence is empty, by (i). The same reasoning 
shows that the sets V 0 {f#1} and W 7 {f #0} are empty. Hence f= 1 
on Vf =0on W. 

This proves (iii), and shows also that ph(V)= L(V). Taking com- 
plements, we see that u(K°) = p(K) for every compact K < A. 

If E is a Borel set in A, and ¢ > 0, then there is a compact K and an 
open V such that K c Ec V and p(V) < p(K) + «. Hence 


u(E) < u(V) = WV) < w(K) + ¢ = p(K°) + € < u(E’) +, 


and this proves (iv). 
It is an easy consequence of (iii) that disjoint open sets have disjoint 
closures. 
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If we define two bounded Borel functions g and w to be equivalent, 
provided p{g ¥ w} = 0, then (ii) asserts that every equivalence class con- 
tains one continuous function—and only one, by (i). Hence (with an 
obvious interpretation) L*(u) = C(A). 

Property (iv) asserts, among other things, that of two disjoint Borel 
sets in A, at most one can be dense in A, even though no point of A is 
isolated (Exercise 18). 

We conclude with an application to measure theory. If E and F are 
measurable sets, let us say that F almost contains E if F contains E except 
for a set of measure 0, that is, if m(E — F) = 0. 

The union of an uncountable collection of measurable sets is not 
always measurable. However, the following is true: 


If {E,} is an arbitrary collection of measurable sets in (0, 1], there is a 
measurable set E < [0, 1] with the following two properties: 


(i) E almost contains every E,. 


(ii) If F is measurable and F almost contains every E,, then F almost 
contains E. 


Thus E is the least upper bound of {E,}. The existence of E implies 
that the Boolean algebra of measurable sets (modulo sets of measure 0) is 
complete. 

With the machinery now at our disposal, the proof is very simple. 

Let f, be the characteristic function of E,. Its Gelfand transform f, is 
then the characteristic function of an open (and closed) set Q, < A. Let Q be 
the union of all these Q,. Then Q is open, so is its closure Q, and there 
exists f € [°(m) such that f is the characteristic function of O. The desired 
set E is the set of all x € (0, 1] at which f(x) = 1. 


Involutions 


11.14 Definition A mapping x—x* of a complex (not necessarily 
commutative) algebra A into A is called an involution on A if it has the 
following four properties, for allx € A,yeé A,andAe C: 


(1) (x + y)* = x* + y*. 
(2) (Ax)* = Ax*. 

(3) (xy)* = y*x*. 
(4) Dae 


In other words, an involution is a conjugate-linear antiautomorphism 
of period 2. 
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Any x € A for which x* = x is called hermitian, or self-adjoint. 

For example, ff is an involution on C(X). The one that we will be 
most concerned with later is the passage from an operator on a Hilbert 
space to its adjoint. 


11.15 Theorem [fA is a Banach algebra with an involution, and if x € A, 
then 


(a) x+x*, i(x — x*), and xx* are hermitian, 

(b) x has a unique representation x = u + iv, with ue A, v € A, and both u 
and v hermitian, 

(c) the unit e is hermitian, 

(d) x is invertible in A if and only if x* is invertible, in which case 
(x*)~! = (x7 4)*, and 

(e) Ae a(x) if and only if 4 € o(x*). 


PROOF. Statement (a) is obvious. If 2u = x + x*, 2v = i(x* — x), then 


x =u + iv 1s a representation as in (b). Suppose x = uw’ + iv’ is another 
one. Put w = v’ — v. Then both w and iw are hermitian, so that 


iw = (iw)* = —iw* = —iw. 


Hence w = 0, and the uniqueness follows. 
Since e* = ee*, (a) implies (c); (d) follows from (c) and (xy)* = 
y*x*. Finally, (e) follows if (d) is applied to Ae — xin place of x. —//// 


11.16 Theorem [If the Banach algebra A is commutative and semisimple, 
then every involution on A is continuous. 


PROOF. Let h be a complex homomorphism of A, and define (x) = 
h(x*). Properties (1) to (3) of Definition 11.14 show that @ is a complex 
homomorphism. Hence @ is continuous. Suppose x, > x and x* > y 
in A. Then 


h(x*) = $(x) = lim (x,) = lim h(t) = Aly). 


Since A is semisimple, y = x*. Hence x > x* is continuous, by the 
closed graph theorem. //// 


11.17 Definition A Banach algebra A with an involution x > x* that 
satisfies 


(1) |xx*|] = ||xII? 


for every x € A is called a B*-algebra. 
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Note that ||x||? = ||xx*|| < |||] |lx*|]| implies ||x|| < ||x*||, hence also 
|x* || < ]x**]| = IIx]. 
Thus 
(2) |x* |] = |x| 
in every B*-algebra. It also follows that 
(3) l|xx* || = |||] ]<* I. 


Conversely, (2) and (3) obviously imply (1). 
The following theorem is the key to the proof of the spectral theorem 
that will be given in Chapter 12. 


11.18 Theorem (Gelfand-Naimark) Suppose A is a commutative B*- 
algebra, with maximal ideal space A. The Gelfand transform is then an isomet- 
ric isomorphism of A onto C(A), which has the additional property that 


(1) h(x*) = h(x) (xe A, he A), 


or, equivalently, that 


(2) (xt) =x (x € A). 

In particular, x is hermitian if and only if Xx is a real-valued function. 

The interpretation of (2) is that the Gelfand transform carries the 
given involution on A to the natural involution on C(A), which is conjuga- 


tion. Isomorphisms that preserve involutions in this manner are often called 
*-isomorphisms. 


PROOF. Assume first that u € A, u = u*, h e A. We have to prove that 
h(u) is real. Put z = u + ite, for real t. If h(u) = a + if, with « and B 
real, then 


h(z) =a+ i(B + 0), zz* = u* + te, 
so that 
a”? + (B + t)? =|h(z)|? < |[2z||* = |lzz*I| < lull? + ¢?, 
Or 
(3) a? + B2+2Bt< lull? (-wo<t<o). 


By (3), B = 0; hence h(u) is real. 

If xe A, then x =u+iv, with u=u*, v =v*. Hence x* = 
u — iv. Since i and 0 are real, (2) is proved. 

Thus A is closed under complex conjugation. By the Stone- 
Weierstrass theorem, A is therefore dense in C(A). 
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If xe A and y=xx*, then y=y* so that |/y?|| = |lyl|?. It 
follows, by induction on n, that ||y”|| = |ly||" for m= 2". Hence 
IS. = llyll, by the spectral radius formula and (c) of Theorem 11.9. 
Since y = xx*, (2) implies that f = | X|?. Hence 


WX = Wo = Wyll = [xxl] = [xl], 


or ||X||,, = ||x||. Thus x > & is an isometry. Hence A is closed in C(A). 
Since A is also dense in C(A), we conclude that A = C(A). This com- 
pletes the proof. ny 


The next theorem is a special case of the one just proved. We shall 
state it in a form that involves the inverse of the Gelfand transform, in order 
to make contact with the symbolic calculus. 


11.19 Theorem If A is a commutative B*-algebra which contains an 
element x such that the polynomials in x and x* are dense in A, then the 
formula 


(1) (Hf) =feo x 
defines an isometric isomorphism ¥ of C(a(x)) onto A which satisfies 
(2) Vf = (Pf)* 


for every f € C(a(x)). Moreover, if f(A) = 4 on a(x), then Pf = x. 


PROOF. Let A be the maximal ideal space of A. Then x is a continuous 
function on A whose range is a(x). Suppose h, € A, h, € A, and 
X(h,) = X(h,), that is, h,(x) = h,(x). Theorem 11.18 implies then that 
h,(x*) = h,(x*). If P is any polynomial in two variables, it follows that 


hy(P(x, x*)) = h2(P(x, x*)), 


since h, and h, are homomorphisms. By hypothesis, elements of the 
form P(x, x*) are dense in A. The continuity of h, and h, implies 
therefore that h,(y) = h,(y) for every y € A. Hence h, =h,. We have 
proved that X is one-to-one. Since A is compact, it follows that x is a 
homeomorphism of A onto a(x). 

The mapping f— f° xX is therefore an isometric isomorphism of 
C(a(x)) onto C(A) which also preserves complex conjugation. 

Each f  X is thus (by Theorem 11.18) the Gelfand transform of a 
unique element of A which we denote by ‘Vf and which satisfies 
IF || = I fll... Assertion (2) comes from (2) of Theorem 11.18. If 
f(A) = A, then fo X = X, so that (1) gives Pf = x. iy 


Remark. In the situation described by Theorem 11.19, it makes per- 
fectly good sense to write f(x) for the element of A whose Gelfand 
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transform is f° x. This notation 1s indeed frequently used. It extends 
the symbolic calculus (for these particular algebras) to arbitrary con- 
tinuous functions on the spectrum of x, whether they are holomorphic 
or not. 


The existence of square roots is often of special interest, and in alge- 
bras with involution one may ask under what conditions hermitian ele- 
ments have hermitian square roots. 


11.20 Theorem Suppose A is a commutative Banach algebra with an 
involution, x € A, x = x*, and o(x) contains no real A with A < 0. Then there 
exists y € A with y = y* and y* = x. 


Note that the given involution is not assumed to be continuous. This 
will give us an opportunity to use the radical of A. We shall see later, in 
Theorem 11.26, that commutativity can be dropped from the hypotheses. 
This will be used in the proof of Theorem 11.31. 


PROOF. Let Q be the complement (in @) of the set of all nonpositive 
real numbers. There exists f € H(Q) such that f?(A) = A, and f(1) = 1. 
Since a(x) < Q, we can define y € A by 


(1) y =f (x), 
as in Definition 10.26. Then y? = x, by Theorem 10.27. We will prove 
that y* = y. 


Since Q is simply connected, Runge’s theorem furnishes poly- 
nomials P, that converge to f, uniformly on compact subsets of Q. 
Define Q,, by 


(2) 20,(4) = P,(A) + P,(A). 


Since f(A) = f(A), the polynomials Q, converge to f in the same 
manner. Define 


(3) yn = Q,(x) (n = 1, 2, 3, ...). 


By (2), the polynomials Q, have real coefficients. Since x = x*, it 
follows that y, = y*. By Theorem 10.27, 

(4) y = limy,, 

since Q, > f, so that Q,(x) > f(x). If the involution were assumed to be 
continuous, the set of hermitian elements would be closed, and y* = y 


would follow directly from (4). 
Let R be the radical of A. Let x: A ~ A/R be the quotient map. 
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If x(x) = n(y) and z = x — y, then z € R; hence z* e€ R because 
p(z*) = p(z) = 0 (see Theorem 11.15), and therefore 2(x*) = x(y*). This 
shows that the formula 


(5) [x(a)]* = x(a*) (ae A) 


defines an involution in A/R. If a € A is hermitian, so is x(a). Since x is 
continuous, 2(y,)— 2(y). Since A/R is isomorphic to A (Theorem 
11.9), A/R is semisimple, and therefore every involution in A/R is con- 
tinuous (Theorem 11.16). It follows that z(y) is hermitian. Hence 
my) = n(y*). 

We conclude that y* — y lies in the radical of A. 

By Theorem 11.15, y = u + iv, where u = u* and v = v*. We just 
proved that v € R. Since x = y?, we have 


(6) x =u? — v? + 2iu. 


Let h be any complex homomorphism of A. Since ve R, h(v) = 0. 
Hence h(x) = [h(u)]?. By hypothesis, 0 ¢ o(x). Thus h(x) # 0; hence 
h(u) #0. By Theorem 11.5, u is invertible in A. Since x = x*, (6) 
implies that uv = 0. Since v = u~'(uv), we conclude that v = 0. This 
completes the proof. //// 


Remark. If o(x) < (0, 0), then also a(y) < (0, «). This follows from 
(1) (the definition of y) and the spectral mapping theorem. 


Applications to Noncommutative Algebras 


Noncommutative algebras always contain commutative ones. Their pres- 
ence can sometimes be exploited to extend certain theorems from the com- 
mutative situation to the noncommutative one. On a trivial level, we have 
already done this: In the elementary discussion of spectra, our attention 
was usually fixed on one element x € A; the (closed) subalgebra A, of A 
that x generates is commutative, and much of the discussion took place 
within A,. One possible difficulty was that x might have different spectra 
with respect to A and Aj. There is a simple construction (Theorem 11.22) 
that circumvents this. Another device (Theorem 11.25) can be used when A 
has an involution. 


11.21 Centralizers If S is a subset of a Banach algebra A, the central- 
izer of S is the set 


I(S) = {x € A: xs = sx for every s € S}. 


We say that S commutes if any two elements of S commute with each other. 
We shall use the following simple properties of centralizers. 
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(a) I(S) is a closed subalgebra of A. 
(b) ScT(I(S)). 
(c) If S commutes, then T(I(S)) commutes. 


Indeed, if x and y commute with every s € S, so do Ax, x + y, and xy; 
since multiplication is continuous in A, I'(S) is closed. This proves (a). Since 
every s € S commutes with every x € I(S), (b) holds. If S commutes, then 
S < I(S), hence I'(S) > I'(I'(S)), which proves (c), since I(E) obviously com- 
mutes whenever I'(E) c E. 


11.22 Theorem Suppose A is a Banach algebra, S < A, S commutes, and 
B = I(I(S)). Then B is a commutative Banach algebra, S < B, and o,(x) = 
o ,(x) for every x € B. 


PROOF. Since e € B, Section 11.21 shows that B is a commutative 
Banach algebra that contains S$. Suppose x € B and x 1s invertible 
in A. We have to show that x 'e B. Since x € B, xy = yx for 
every yeI(S); hence y=x "yx, yx"'=x 'y. This says that 
x~' € T(I(S)) = B. //// 


11.23. Theorem Suppose A is a Banach algebra, x € A, y€ A, and 
xy = yx. Then 


o(x + y) < o(x) + a(y) and a(xy) < a(x)a(y). 


PROOF. Put S = {x, y}; put B = I(I(S)). Then x + y € B, xy € B, and 
Theorem 11.22 shows that we have to prove that 


Op(x + y) < O(X) + 9,(y) and O_(XY) C Gp(X)oQ()). 

Since B is commutative, o,(z) is the range of the Gelfand trans- 
form Z, for every z € B. (The Gelfand transforms are now functions on 
the maximal ideal space of B.) Since 

(x+y4=xk+)Y and (xy) = xj, 


we have the desired conclusion. diy 
11.24 Definition Let A be an algebra with an involution. If x € A and 


xx* = x*x, then x is said to be normal. A set S < A is said to be normal if S 
commutes and if x* e S whenever x € S. 


11.25 Theorem Suppose A is a Banach algebra with an involution, and B 
is anormal subset of A that is maximal with respect to being normal. Then 
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(a) Bis aclosed commutative subalgebra of A, and 
(b) o,(x) = 0 ,(x) for every x € B. 


Note that the involution is not assumed to be continuous but that B 
nevertheless turns out to be closed. 


PROOF. We begin with a simple criterion for membership in B: If 
x € A, if xx* = x*x, and if xy = yx for every y € B, then x € B. 

For if x satisfies these conditions, we also have xy* = y*x for all 
y € B, since B is normal, and therefore x*y = yx*. It follows that 
B u {x, x*} is normal. Hence x € B, since B is maximal. 

This criterion makes it clear that sums and products of members 
of B are in B. Thus B is a commutative algebra. 

Suppose x, € B and x, — x. Since x, y = yx, for all y € B, and 
multiplication is continuous, we have xy = yx and therefore also 


x*y = (y*x)* = (xy*)* = yx*. 
In particular, x*x, = x, x* for all n, which leads to x*x = xx*. Hence 
x € B, by the above criterion. This proves that B is closed and com- 
pletes (a). 
Note also that e € B. To prove (b), assume x € B, x ' € A. Since 


x is normal, so is x~*, and since x commutes with every y € B, so does 
x” +. Hence x~/ € B. //// 


Our first application of this is a generalization of Theorem 11.20: 


11.26 Theorem The word “commutative” may be dropped from the 
hypothesis of Theorem 11.20. 


PROOF. By Hausdorff’s maximality theorem, the given hermitian 
(hence normal) x € A lies in some maximal normal set B. By Theorem 
11.25 we can apply Theorem 11.20 with B in place of A. //// 


Our next application of Theorem 11.25 will extend some conse- 


quences of Theorem 11.18 to arbitrary (not necessarily commutative) B*- 
algebras. 


11.27. Definition In a Banach algebra with involution, the statement 
“x > 0” means that x = x* and that o(x) < [0, o). 


11.28 Theorem Every B*-algebra A has the following properties: 


(a) 
(b) 
(c) 
(d) 
(e) 
(f) 
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Hermitian elements have real spectra. 
If x € Ais normal, then p(x) = ||x|l. 


If y € A, then p(yy*) = |ly|l’. 

Ifue A,ve Aju>0,andv>0,thenu+v>0. 
If y € A, then yy* > 0. 

If y € A, then e + yy* is invertible in A. 


PROOF. Every normal x € A lies in a maximal normal set Bc A. By 
Theorems 11.18 and 11.25, B is a commutative B*-algebra which is 
isometrically isomorphic to its Gelfand transform B = C(A) and which 
has the property that 


(1) o(z)=3(A) (ze B). 


Here o(z) is the spectrum of z relative to A, A is the maximal ideal 
space of B, and 2(A) is the range of the Gelfand transform of z, regard- 
ed as an element of B. 

If x = x*, Theorem 11.18 shows that x is a real-valued function 
on A. Hence (1) implies (a). 

For any normal x, (1) implies p(x) = ||X||,,. Also, |X|, = ||xll, 
since B and B are isometric. This proves (b). 

If y € A, then yy* is hermitian. Hence (c) follows from (b), since 
o(yy*) = Ilyy*|| = yl’. 

Suppose now that u and v are as in (d). Put « = |lu||, B = |v], 
w=u+ov0,y=a+ fB. Then o(u) c [0, a], so that 


(2) o(ae — u) c [0, a] 

and (b) implies therefore that ||we — ul| < «. For the same reason, 
|Be — v|| < B. Hence 

(3) lye — wll Sy. 

Since w = w*, (a) implies that o(ye — w) is real. Thus 

(4) a(ye —w) c [—», y], 


because of (3). But (4) implies that o(w) < [0, 2y]. Thus w > 0, and (d) 
is proved. 
We turn to the proof of (e). Put x = yy*. Then x is hermitian, 
and if B is chosen as in the first paragraph of this proof, then x is a 
real-valued function on A. By (1), we have to show that X > 0 on A. 
Since B = C(A), there exists z € B such that 


(5) S=|%|-%& ond. 
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Then z = z*, because 2 is real (Theorem 11.18). Put 

(6) Zy=w=u+iv, 

where u and v are hermitian elements of A. Then 

(7) ww* = zyy*z* = 2xz = 27x 

and therefore 

(8) w*w = 2u? + 20? — ww* = 2u? + 2v? — 27x. 


Since u = u*, o(u) is real, by (a), hence u* > 0, by the spectral mapping 
theorem. Likewise v? > 0. By (5), 27% <0 on A. Since 27x ¢€ B, it 
follows from (1) that —z*x > 0. Now (8) and (d) imply that w*w > 0. 
But o(ww*) < o(w*w) U {0} (Exercise 4, Chapter 10). Hence 
ww* > 0. By (7), this means that 27% >0 on A. By (5), this last 
inequality holds only when x = |X|. Thus X > 0, and (e) is proved. 
Finally, (f) is a corollary of (e). //// 


Equality of spectra can now be proved in yet another situation, in 
which commutativity plays no role. 


11.29 Theorem Suppose A is a B*-algebra, B is a closed subalgebra of 
A, e € B, and x* € B for every x € B. Thena ,(x) = o,(x) for every x € B. 


PROOF. Suppose x € B and x has an inverse in A. We have to show 
that x~+ e B. Since x is invertible in A, so is x*, hence also xx*, and 
therefore 0 ¢ a ,(xx*). By (a) of Theorem 11.28, 0 ,(xx*) < (— ©, oo), 
so that o,(xx*) has a connected complement in ¢. Theorem 10.18 
shows now that o,(xx*) = o ,(xx*). Hence (xx*)~' € B, and finally 
x” > = x*(xx*)' © B. //// 


Positive Functionals 


11.30 Definition A positive functional is a linear functional F on a 
Banach algebra A with an involution, that satisfies 


F(xx*) > 0 


for every x € A. Note that A is not assumed to be commutative and that 
continuity of F is not postulated. (The meaning of the term “positive” 
depends of course on the particular involution that is under consideration.) 


11.31 Theorem Every positive functional F on a Banach algebra A with 
involution has the following properties: 


(a) 
(b) 
(c) 
(d) 
(e) 
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F(x*) = F(x). 

| F(xy*) |? < F(xx*)F(yy*). 

| F(x) |? < F(e)F(xx*) < Fle)? p(xx*). 

| F(x)| < F(e)p(x) for every normal x € A. 


F is a bounded linear functional on A. Moreover, ||F\|| = F(e) if A is 
commutative, and ||F || < B'/?F(e) if the involution satisfies |\x*|| < B\|x|| 
for everyx € A. 


PROOF. If x € A and y € A, put 
(1) p=F(xx*), q=Flyy*) r=F(xy*)  s= F(yx*). 
Since F[(x + ay)(x* + ay*)] > 0 for every ae , 
(2) ptor+as+|al7q>0. 
With « = 1 and « = i, (2) shows that s = r and i(s — r) are real. Hence 
s =r. With y = e, this gives (a). 

If r = 0, (b) is obvious. If r # 0, take « = tr/|r| in (2), where t 1s 
real. Then (2) becomes 
(3) pt+2|rjt+qt?>0 (—-w<t<o), 


so that |r|* < pq. This proves (b). 

Since ee* = e, the first half of (c) is a special case of (b). For the 
second half, pick t > p(xx*). Then o(te — xx*) lies in the open right 
half-plane. By Theorem 11.26, there exists u e€ A, with u = u*, such 
that u* = te — xx*. Hence 


(4) tF(e) — F(xx*) = F(u*) > 0. 
It follows that 
(5) F(xx*) < F(e)p(xx*). 


This completes part (c). 
If x is normal, ie., if xx* = x*x, Theorem 11.23 implies that 
o(xx*) < o(x)o(x*), so that 


(6) p(xx*) < p(x)p(x*) = p(x)’. 


Clearly, (d) follows from (6) and (c). 

If A is commutative, then (d) holds for every x € A, so that 
Fl = Fle). If |x*|| < Bilx|l, (©) implies | F(x)| < F(e)p"*||x\|, since 
p(xx*) < ||x||||x*||. This disposes of the special cases of part (e). 

Before turning to the general case, we observe that F(e) > 0 and 
that F(x) = 0 for every x € A if F(e) = 0; this follows from (c). In the 
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remainder of this proof we shall therefore assume, without loss of gen- 
erality, that 


(7) F(e) = 1. 


Let H be the closure of H, the set of all hermitian elements of A. 
Note that H and iH are real vector spaces and that A = H + iH, by 
Theorem 11.15. By (d), the restriction of F to H is a real-linear func- 
tional of norm 1, which therefore extends to a real-linear functional ® 
on H, also of norm 1. We claim that 


(8) Oy)=0 £ifyeH nid, 


for if y = lim u, = lim (iv,), where u, € H and v, € H, then u? > y?, 
v2 — — y’, so that (c) and (d) imply 


(9) | F(u,) |? < Flug) < F(uz + vf) < |lug + v7 || > 0. 


Since ®(y) = lim F(u,), (8) is proved. 

By Theorem 5.20, there is a constant y < oo such that every 
x € A has a representation 
(10) x=x,+ix,, x,€H, x, eH, xl] + llx2ll < vill. 


If x=u+iv, with we H, ve dH, then x, —u and x,—v he in 
H oc i. Hence (8) yields 


(11) F(x) = F(u) + iF(v) = O(x,) + iD(x,), 
so that 
(12) | F(x) | < | Bx1)| + | 2) | < [all + lxall < vile. 
This completes the proof. //// 


Exercise 13 contains further information about part (e). 
Examples of positive functionals—and a relation between them and 


positive measures—are furnished by the next theorem. It contains 
Bochner’s classical theorem about positive-definite functions as a very 
special case. The identifications that lead from one to the other are indi- 
cated in Exercise 14. 


11.32 Theorem Suppose A is a commutative Banach algebra, with 
maximal ideal space A, and with an involution that is symmetric in the sense 


that 


(1) 


h(x*) = h(x) (x € A, he A). 


Let K be the set of all positive functionals F on A that satisfy F(e) < 1. 


CHAPTER 11: COMMUTATIVE BANACH ALGEBRAS 299 


Let M be the set of all positive regular Borel measures yp on A that satisfy 
u(A) < 1. Then the formula 


(2) F(x) = [3 du (x € A) 


establishes a one-to-one correspondence between the convex sets K and M, 
which carries extreme points to extreme points. 

Consequently, the multiplicative linear functionals on A are precisely the 
extreme points of K. 


PROOF. If » € M and F is defined by (2), then F is obviously linear, 
and F(xx*) = J |X|? du=0, because (1) implies that (xx*)* = |%|?. 
Since F(e) = p(A), F e€ K. 

If F € K, then F vanishes on the radical of A, by (d) of Theorem 
11.31. Hence there is a functional F on A that satisfies F(X) = F(x) for 
all x € A. In fact, 


(3) | F(&)| = | FO) | < Fe)p(x) = FN = ( € A). 


by (d) of Theorem 11.31. It follows that F is a linear functional of 
norm F(e) on the subspace A of C(A). This extends to a functional on 
C(A), with the same norm, and now the Riesz representation theorem 
furnishes a regular Borel measure p, with |||| = F(e), that satisfies (2). 
Since 


(4) py(A) = [a du = Fle) = |ull, 


we see that wp > 0. Thus we M. 

By (1), A satisfies the hypotheses of the Stone-Weierstrass 
theorem and is therefore dense in C(A). This implies that p is uniquely 
determined by F. 

One extreme point of M is 0; the others are unit masses concen- 
trated at points h € A. Since every complex homomorphism of A has 
the form x > x(h), for some h é€ A, the proof is complete. //// 


We conclude by showing that the extreme points of K are multiplica- 
tive even if (1) is not satisfied. 


11.33 Theorem Let K be the set of all positive functionals F on a com- 
mutative Banach algebra A with an involution that satisfy F(e) < 1. If F € K, 
then each of the following three properties implies the other two: 


(a) F(xy) = F(x)F(y) for all x and y €é€ A. 
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(b) F(xx*) = F(x)F(x*) for every x € A. 
(c) F is an extreme point of K. 


PROOF. It is trivial that (a) implies (b). Suppose (b) holds. With x = e, 
(b) shows that F(e) = F(e)?, and so F(e)=0 or F(e)=1. When 
F(e) = 0, then F = 0, by (c) of Theorem 11.31, and so F is an extreme 
point of K. Assume F(e) = 1, and 2F = F, + F,, F, € K, F, € K. We 
have to show that F, = F. Clearly, F,(e) = 1 = F(e). If x € A is such 
that F(x) = 0, then 


(1) | F(x) |? < Fy(xx*) < 2F(xx*) = 2F(x)F(x*) = 0, 


by (b) and Theorem 11.31. Thus F, coincides with F on the null space 
of F and at e. It follows that F, = F. Hence (b) implies (c). 

To show that (c) implies (a), let F be an extreme point of K. 
Either F(e) = 0, in which case there is nothing to prove, or F(e) = 1. 
We shall first prove a special case of (a), namely, 


(2) F(xx*y) = F(xx*)F(y) (x € A, y € A). 


Choose x so that ||xx*|| < 1. By Theorem 11.20, there exists z € A, 
z = z*, such that z* = e — xx*. Define 


(3) Dy) = F(xx*y) — (y € A). 

Then 

(4) D(yy*) = F(xx*yy*) = FL(xy\xy)*] = 9, 

and also 

(5) (F — ®)(yy*) = F[(e — xx*)yy*] = F(z’yy*) = FU(yz)(yz)*] = 0. 
Since 

(6) 0 < Me) = F(xx*) < F(e)||xx*|| < 1, 


(4) and (5) show that both ® and F — @ are in K. If ®(e) = 0, then 
® = 0. If D(e) > 0, (6) shows that 


@ F—® 
(7) F = Oe): —_ + (F — ®Xe) - —~-___, 
P(e) F(e) — ®e) 
a convex combination of members of K. Since F is extreme, we con- 
clude that 
(8) ® = O(e)F. 


Now (2) follows from (8) and (3). 
Finally, the passage from (2) to (a) is accomplished by any of the 
following identities, which are satisfied by every involution: 
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If n= 3, 4, 5, ..., if wo =exp (2zi/n), if xe A, and if z,= 
e+q@ x, then 


(9) a 


sie 


n 

Py ot 
» OF 
p=1 


The proof of (9) is a straightforward computation which uses the 
fact that 


(10) 2», w? = p29 wm’? = 0. //// 


Exercises 


1. 


Prove Proposition 11.2. 


2. State and prove an analogue of Wiener’s lemma 11.6 for power series that con- 


verge absolutely on the closed unit disc. 


. If X is a compact Hausdorff space, show that there is a natural one-to-one 


correspondence between closed subsets of X and closed ideals of C(X). 


. Prove that the polynomials are dense in the polydisc algebra A(U"). (See 


Theorem 11.7.) Suggestion: If fe A(U"), 0<r< 1, and f, is defined by f(z) = 
f(rz), then f, is the sum of an absolutely (hence uniformly) convergent multiple 
power series on U". 


. Suppose A is a commutative Banach algebra, x € A, and f is holomorphic in 


some open set 2 c ¢ that contains the range of Xx. Prove that there exists y € A 
such that ) = fo x, that is, such that h(y) = f(h(x)) for every complex homo- 
morphism h of A. Prove that y is uniquely determined by x and f if A is semi- 
simple. 


. Suppose A and B are commutative Banach algebras, B is semisimple, y: A ~ B 


is a homomorphism whose range is dense in B, and a: A, > A,, 1s defined by 
(ah\x) = h(W(x)) = (xe A, he Ap). 


Prove that a is a homeomorphism of A, onto a compact subset of A,. [The fact 
that y(A) is dense in B implies that a is one-to-one and that the topology of A, 
is the weak topology induced by the Gelfand transforms of the elements w(x), 
for x € A.] 

Let A be the disc algebra, let B = C(K), where K is an arc in the unit disc, 
and let Ww be the restriction mapping of A into B. This example shows that a(A,) 
may be a proper subset of A,, even if w is one-to-one. 

Find an example in which f(A) = B but a(A,) # A,. 


. In Example 11.13(b) it was asserted that A # C(A). Find several proofs of this. 
. Which properties of Lebesgue measure are used in Example 11.13(f)? Can 


Lebesgue measure be replaced by any positive measure, without changing any 
of the results? 
Supply the details for the last paragraph in Example 11.13(/). 


. Let C’ be the algebra of all continuously differentiable complex functions on the 


unit interval [0, 1], with pointwise multiplication, normed by 
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10. 


11. 


12. 


13. 


IF = WF lo + WF Mo: 


(a) Show that C’ is a semisimple commutative Banach algebra. Find its 
maximal ideal space. 

(b) Fix p, 0 < p < 1; let J be the set of all fe C’ for which f(p) = f'(p) = 0. Show 
that J is a closed ideal in C’ and that C’/J is a two-dimensional algebra 
which has a one-dimensional radical. (This gives an example of a semisimple 
algebra with a quotient algebra that is not semisimple.) To which of the two 
algebras described in Exercise 5 of Chapter 10 is C’/J isomorphic? 

Let A be the disc algebra. Associate to each fe A a function f* € A by the 

formula 


f*(z) =f (2). 
Then f— f* is an involution on A. 
(a) Does this involution turn A into a B*-algebra? 
(b) Does o( ff *) always lie in the real axis? 
(c) Which complex homomorphisms of A are positive functionals, with respect 
to this involution? 
(d) If wis a positive finite Borel measure on [ — 1, 1], then 


i= | f(t) du(t) 


is a positive functional on A. Are there any others? 
Show that commuting idempotents have distance >1. Explicitly, if x? = x, 
y* = y, xy = yx for some x and y in a Banach algebra, then either x = y or 
|x — yl] => 1. Show that this may fail if xy # yx. 


If xy = yx for some x and y in a Banach algebra, prove that 
p(xy) <= p(x)p(y) and = p(x + y) S p(x) + ply). 
Let t be a large positive number, and define a norm on ¢” by 
wll = [wl + tlw if w = (Wy, Wo). 


Let A be the algebra of all complex 2-by-2 matrices, with the corresponding 
operator norm: 


llyll = max {lly(w)|]: wll =1} (ye A). 


For ye A, let y* be the conjugate transpose of y. Consider a fixed x eé A, 
namely, 


Prove the following statements. 

(a) ||x(w)|| = ¢||w]]; hence ||x|] = ¢. 
(b) o(x) = {t, —t} = o(x*). 

(c) o(xx*) = (1, t*} = o(x*x). 

(d) ox + x*)= {14 07, -1-27}. 
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(e) Therefore commutativity is required in Theorem 11.23 and in Exercise 12. 

(f) If F(y) is the sum of the four entries in y, for y e A, then F is a positive 
functional on A. 

(g) The equality ||F'|| = F(e) [see (e) of Theorem 11.31] does not hold, because 
F(e) = 2 and F(x) = 1 + t?, so that ||F'l| > t. 

(h) If K is the set of all positive functionals f on A that satisfy f(e) < 1 (as in 
Theorem 11.33), then K has many extreme points, although 0 is the only 
multiplicative linear functional on A. Commutativity is therefore required in 
the implication (c) — (a) of Theorem 11.33. 

14. A complex function ¢, defined on R’, is said to be positive-definite if 


Z c,¢; A(x; — x) = 0 


1 


° 


tJ 


for every choice of x,,..., x,, in R” and for every choice of complex numbers 

Cite. 

(a) Show that | d(x)| < $(0) for every x € R". 

(b) Show that the Fourier transform of every finite positive Borel measure on R" 
is positive-definite. 

(c) Complete the following outline of the converse of (b) (Bochner’s theorem): If 
@ is continuous and positive-definite, then ¢ is the Fourier transform of a finite 
positive Borel measure. 

Let A be the convolution algebra L'(R"), with a unit attached, as 
described in (d) of Section 10.3 and (e) of Section 11.13. Define f(x) = f(—x). 
Show that 


f+ad>f + 26 


is an involution on A and that 


f+ a6 ~| Sd dm,, + «9(0) 


Is a positive functional on A. By Theorem 11.32 and (e) of Section 11.13, 
there is a positive measure p on the one-point compactification A of R", such 
that 


| fo dm, + 26(0)= | (f+ a) dy. 


If o is the restriction of yu to R", it follows that 


| sodm,=| fide 


for every f € L'(R"). Hence ¢ = 6G. (Actually, u is already concentrated on R", 
so that o = p.) 

(d) Let P be the set of all continuous positive-definite functions @ on R” that 
satisfy ¢(0) < 1. Find all extreme points of this convex set. 


15. Let A be the maximal ideal space of a commutative Banach algebra A. Call a 
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16. 


17. 
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closed set B <A an A-boundary if the maximum of |x| on A equals its 
maximum on f, for every x € A. (Trivially, A is an A-boundary.) 

Prove that the intersection 0, of all A-boundaries is an A-boundary. 

0, is called the Shilov boundary of A. The terminology is suggested by the 
maximum modulus property of holomorphic functions. For instance, when A is 
the disc algebra, then ¢, is the unit circle, which is the topological boundary of 
A, the closed unit disc. 

Outline of proof: Show first that there is an A-boundary f, which is 
minimal in the sense that no proper subset of B, is an A-boundary. (Partially 
order the collection of A-boundaries by set inclusion, etc.) Then pick hy € Bo, 
pick x,,..., xX, € A with x,(h,) = 0, and put 


V = {he A: |X{h)| < 1 for 1 <i <n}. 


Since f, is minimal, there exists x € A with ||X||,, = 1 and | X(h)| < 1 on By — V. 

If y=x™ and m is sufficiently large, then |X; )| <1 on By, for all i. Hence 

|X; Sil... < 1. Conclude from this first that | }(h)| = || ||,, only in V, hence that 

V intersects every A-boundary 8, and finally that h, e 8B. Thus B, < B, and 
0 = 94. 

Suppose A is a Banach algebra, m is an integer, m > 2, K < oo, and 


|x|" < K]lx™| 


for every x € A. Show that there exist constants K, < oo, for n= 1, 2, 3,..., 
such that 


|x" < K,|x"l] (x € A). 


(This extends Theorem 11.12.) 
Suppose {@,,} (— 00 <n < oo) are positive numbers such that w, = 1 and 


Om+n = Om Vn 


for all integers m and n. Let A = A{q,} be the set of all complex functions f on 
the integers for which the norm 


Ifl= DL IfMle, 
is finite. Define multiplication in A by 


(f*gXn)= d f(n— kyg(k). 
k=—o 

(a) Show that each A{w,} is a commutative Banach algebra. 

(b) Show that R, = lim,_,,, (@,)'” exists and is finite, by showing that R, = 
inf, 50 (,)*". 

(c) Show similarly that R_ = lim,.,,, (w_,)~ ' exists and that R_ < R,. 

(d) Put A= {Ae ¢: R_ <|A| < R,}. Show that A can be identified with the 
maximal ideal space of A{w,} and that the Gelfand transforms are abso- 
lutely convergent Laurent series on A. 


18. 


19. 


20. 


21. 


22. 
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(e) Consider the following choices for {w,}: 
(i) w, = 1. 
(ii) w, = 2". 
(iii) w, = 2° ifn > 0,0, = lifn<0O. 
(iv) w, = 14 2n?. 
(v) w, = 14 2n7? ifn >0,0, =1ifn <0. 
For which of these is A a circle? For which choices is A{w,} self-adjoint, in 
the sense that A is closed under complex conjugation? 
(f) Is A{w,} always semisimple? 
(g) Is there an A{w,}, with A the unit circle, such that A consists entirely of 
infinitely differentiable functions? 
Let A be the maximal ideal space of L”(m), as in Section 11.13. Show that 
(a) A has no isolated point, and 
(b) A contains no convergent sequence of distinct points. Hint: If p,, p., p3,..-, 
are distinct points in A, none of which is a limit point of the others, and if 
{w,} is any bounded sequence of numbers, then there exist pairwise disjoint 
open sets V; in A such that p,; € V,;, and there exists a function @ € C(A) such 
that = w; on V;,. 
Let L°(m) be as above and prove: If f, € L*(m) and f, > 0 in the weak topology 
of L*(m), then {} | f,|? dm—0 for every p € (0, 00). Show, by constructing an 
example, that the converse is false. 
Prove the following partial converse of Theorem 11.31: If F is a bounded linear 
functional on a B*-algebra A, and ||F|| = F(0) = 1, then F is positive. 
Suggestion: Choose x € A, ||x|| < 1, put F(xx*) = a + fi, 


y, = xx* — (4 + ite, 


for —co <t < oo. Use Theorem 11.28 to show that o(xx*) < [0,1] and that 
therefore 


LF(y) 1 < yl = e(y) < 14 + itl. 


Proceed as in Lemma 5.26. 

In €’, let K, consist of all points (e, e~"*), and let K, consist of the points 
(e’*, e'”), 0 < 6 < 2x. Of these circles, show that K, is polynomially convex but 
K, is not. How about K, = {(cos 6,sin 0): 0 < 6 < 22}? 


Show that a 3-by-3 matrix M commutes with 
0 0 1 ax y 
0 0 QO} ifandonlyif M=/0 z wif. 
0 0 0 0 0a 


Deduce from this that centralizers (see Section 11.21) need not be commutative. 


CHAPTER 


BOUNDED 
OPERATORS ON 
A HILBERT 
SPACE 


Basic Facts 


12.1 Definitions A complex vector space H is called an inner product 
space (or unitary space) if to each ordered pair of vectors x and y in H is 
associated a complex number (x, y), called the inner product or scalar 
product of x and y, such that the following rules hold: 


(a) (y, x) =(x, y). (The bar denotes complex conjugation.) 
(b) (x + y, 2) = (x, z) + (y 2). 

(c) (ax, y)=a(x,y)ifxe H,ye Hae ©. 

(d) (x, x) => 0 for all x € H. 

(e) (x, x) =0onlyifx =0. 


For fixed y, (x, y) is therefore a linear function of x. For fixed x, it is a 
conjugate-linear function of y. Such functions of two variables are some- 
times called sesquilinear. 

If (x, y) = 0, x is said to be orthogonal to y, and the notation x | y is 
sometimes used. Since (x, y) = 0 implies (y, x) = 0, the relation | is sym- 
metric. If E -c H and F CH, the notation E | F means that x | y when- 
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ever x € E and ye F. Also, E° is the set of all y e H that are orthogonal to 
every x € E. 
Every inner product space can be normed by defining 


xl] = Ge, x)”. 


Theorem 12.2 implies this. If the resulting normed space is complete, it is 
called a Hilbert space. 


12.2 Theorem If x ¢ H and y eH, where H is an inner product space, 
then 


(1) I(x, y)1 < Hilly 

and 

(2) Ix + yl] < [xl] + lly. 
Moreover 

(3) ly] < Ax +yl| for everyAe¢ 


if and only if x 1 y. 


PROOF. Put a = (x, y). A simple computation gives 
(4) 0 < |JAx + yl]? =|A[? |x|]? + 2 Re (A) + Ilyll?. 


Hence (3) holds if « = 0. If x = 0, (1) and (3) are obvious. If x 4 0, take 
A = —/||x||*. With this A, (4) becomes 


| «|? 


(5) 0s x + yl? = lly? —o. 


This proves (1) and shows that (3) is false when a 4 0. By squaring 
both sides of (2), one sees that (2) is a consequence of (1). jiiy 


Note: Unless the contrary is explicitly stated, the letter H will from 
now on denote a Hilbert space. 


12.3. Theorem Every nonempty closed convex set E<H contains a 
unique x of minimal norm. 
PROOF. The parallelogram law 
(1) x + yl]? + Ix — yl]? = 2))x|]? + 2ilyll? = (we H, ye A) 
follows directly from the definition ||x||? = (x, x). Put 


(2) d = inf {||x||: x € E}. 
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Choose x, € E so that ||x,|| > d. Since 4(x, + x,,) € E, |x, + X,{|7 = 
4d*. If x and y are replaced by x, and x,, in (1), the right side of (1) 
tends to 4d”. Hence (1) implies that {x,} is a Cauchy sequence in H, 
which therefore converges to some x € E, with ||x|| = d. 

If y e E and ||y|| = d, the sequence {x, y, x, y, ...} must converge, 
as we Just saw. Hence y = x. ITT] 


12.4 Theorem If M is aclosed subspace of H, then 
H=M@M-. 


The conclusion is, more explicitly, that M and M? are closed subspaces of 
H whose intersection is {0} and whose sum is H. The space M°~ is called the 
orthogonal complement of M. 


PROOF. If E < H, the linearity of (x, y) as a function of x shows that 
E+ is a subspace of H, and the Schwarz inequality (1) of Theorem 12.2 
implies then that E+ is closed. 

If xe M and xe M-, then (x,x)=0; hence x =0. Thus 
M m M~* = {0}. 

If x €e H, apply Theorem 12.3 to the set x — M to conclude that 
there exists x, € M that minimizes ||x — x,||. Put x, = x — x,. Then 
|x|] < llx, + yl] for all ye M. Hence x, ¢€M", by Theorem 12.2. 
Since x = x, + x, we have shown that M + M+ = H. //// 
Corollary. If M is a closed subspace of H, then 


(M+)+ = M. 
PROOF. The inclusion M c (M*)* is obvious. Since 
M @ M+ = H = M' @(M?)', 
M cannot be a proper subspace of (M*)”. //// 


We now describe the dual space H* of H. 
12.5 Theorem There is a conjugate-linear isometry y > A of H onto H*, 
given by 
(1) Ax = (x, y) (x € A). 
PROOF. If y e H and A is defined by (1), the Schwarz inequality (1) of 
Theorem 12.2 shows that A € H* and that |/A|| < || y|]. Since 


(2) lvl? =(, y) = Ay < WALI, 
it follows that ||Al| = lly]. 
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It remains to be shown that every A e€ H* has the form (1). 
If A = 0, take y = 0. If A #0, let (A) be the null space of A. 
By Theorem 12.4 there exists z € W(A)", z ¥ 0. Since 


(3) (Ax)z — (Az)x € WY(A) (x € H), 
it follows that (Ax)(z, z) —(Az)(x, z) = 0. Hence (1) holds with y = 
(z, z)~1(Az)z. //// 


12.6 Theorem If {x,' is a sequence of pairwise orthogonal vectors in H, 
then each of the following three statements implies the other two. 


[e.6) 


(a) > x, converges, in the norm topology of H. 


n=1 


b) 3 [x,[2 < 00. 


n=1 
foe) 


(c) > (x,, y) converges, for every y € H. 


n=1 


Thus strong convergence (a) and weak convergence (c) are equivalent 
for series of orthogonal vectors. 

PROOF. Since (x;, x;) = 0 if i # j, the equality 

(1) ge a ep = lal ee eal 


holds whenever n < m. Hence (b) implies that the partial sums of )° x, 
form a Cauchy sequence in H. Since H is complete, (b) implies (a). The 
Schwarz inequality shows that (a) implies (c). Finally, assume that (c) 
holds. Define A, € H* by 


(2) A,y = > (y, x) (y € H, n = 1, 2, 3,...). 
i=1 
By (c), {A, y} converges for every y € H; hence {||A,||} is bounded, by 
the Banach-Steinhaus theorem. But 
(3) WA ll = locy +o + ql = {Mea l? + + fen? B PE. 
Hence (c) implies (b). Hit 


Bounded Operators 


In conformity with notations used earlier, @(H) will now denote the Banach 
algebra of all bounded linear operators T on a Hilbert space H # {0}, 
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normed by 
| T| = sup {|| Tx||: x € H, |x|] < 1}. 


We shall see that @(H) has an involution which makes it into a B*-algebra. 
We begin with a simple but useful uniqueness theorem. 


12.7 Theorem If T € &A) and if (Tx, x)=0 for every x € H, then 
r= 0; 


PROOF. Since (T(x + y), x + y) = 0, we see that 


(1) (Tx, y)+ (Ty, x) =90 (xe H, ye #). 

If y is replaced by iy in (1), the result is 

(2) —i(Tx, y)+ i(Ty, x) =0 (xe H, ye H). 

Multiply (2) by i and add to (1), to obtain 

(3) (Tx, y)=0 (xe H, ye A). 

With y = Tx, (3) gives || Tx||? = 0. Hence Tx = 0. //// 


Corollary. If S « BH), T € BA), and 
(Sx, x) = (Tx, x) 
for every x € H, then S = T. 
PROOF. Apply the theorem to S — T. /T// 


Note that Theorem 12.7 would fail if the scalar field were R. To see 
this, consider rotations in R?. 


12.8 Theorem I[ff: H x H—{ is sesquilinear and bounded, in the sense 


that 

(1) M = sup {| f(x, y)I: IIx = llyll = 1} < 0, 
then there exists a unique S € &(H) that satisfies 

(2) f(x, y) = (x, Sy) (x € H, ye H). 


Moreover, ||S|| = M. 
PROOF. Since | f(x, y)| < M||x|||lyll, the mapping 
x > f(x, y) 


is, for each y € H, a bounded linear functional on H, of norm at most 
M|ly||. It now follows from Theorem 12.5 that to each y € H corre- 
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sponds a unique element SyeH such that (2) holds; also, 
|Sy|| < M|ly||. It is clear that S: H — H is additive. If « e ¢, then 


(x, S(ay)) = f(x, ay) = af (x, y) = a(x, Sy) = (x, aSy) 


for all x and y in H. It follows that S is linear. Hence S € &(H), and 
[S|] < M. 
But we also have 


I F(x, vy] = 1, Sy) < PxIMSyll s [lx SH y I, 
which gives the opposite inequality M < ||S||. //// 


12.9 Adjoints If T ¢ AH), then (Tx, y) is linear in x, conjugate-linear 
in y, and bounded. Theorem 12.8 shows therefore that there exists a unique 
T* € &(A) for which 


(1) (Tx, y)=(x, T*y) (xe H, ye A) 
and also that 
(2) | 7* || = | TI. 


We claim that T > T* is an involution on &(H), that is, that the 
following four properties hold: 


(3) (T + S)* = T* + S*. 
(4) (aT)* = aT*. 

(5) (ST)* = T*S*. 
(6) T** = T. 


Of these, (3) is obvious. The computations 
(aT x, y) = aT x, y) = a(x, T*y) = (x, aT *y), 
(ST x, y) = (Tx, S*y) = (x, T*S*y), 
(Tx, y) = (T*y, x) = (y, T**x) = (T**x, y) 
give (4), (5), and (6). Since 
|| Tx||* = (Tx, Tx) = (T*Tx, x) < |T*T|I||x\I? 
for every x € H, we have ||T ||? < || T*T'||. On the other hand, (2) gives 
IT*T | < TANT = ITI? 
Hence the equality 
(7) |T*T|| = | TI? 


holds for every T € @(H). 
We have thus proved that @(H) is a B*-algebra, relative to the involu- 
tion T + T* defined by (1). 
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Note: In the preceding setting, T* is sometimes called the Hilbert 
space adjoint of T, to distinguish it from the Banach space adjoint that was 
discussed in Chapter 4. The only difference between the two is that in the 
Hilbert space setting T — T* is conjugate-linear instead of linear. This is 
due to the conjugate-linear nature of the isometry described in Theorem 
12.5. If T* were regarded as an operator on H* rather than on H, we would 
be exactly in the situation of Chapter 4. 


12.10 Theorem If T &€ &(H), then 
N(T*)=AT)* and W(T) = AT*)*. 


We recall that (T) and &(T) denote the null space and range of T, 
respectively. 


PROOF. Each of the following four statements is clearly equivalent to 
the one that follows and/or precedes it. 


(1) T*y =0. 
(2) (x, T*y) = 0 for every x € H. 
(3) (Tx, y) =0 for every x € H. 
(4) ye AT). 
Thus (T*) = A&(T)*. Since T** = T, the second assertion 
follows from the first if T is replaced by T*. ay 


12.11 Definition An operator T € &(H) is said to be 


(a) normalif TT* = 1*T, 

(b) self-adjoint (or hermitian) if T* = T, 

(c) unitary if T*T = I = TT*, where J is the identity operator on H, 
(d) a projection if T* = T. 


It is clear that self-adjoint operators and unitary operators are 
normal. Most of the theorems obtained in this chapter will be about normal 
operators. 

This algebraic requirement, namely, that T should commute with its 
adjoint, has remarkably strong analytic and geometric consequences. 


12.12 Theorem An operator T € &(H) is normal if and only if 
(1) || Tx|| = | T*x|| 


for every x € H. Normal operators T have the following properties: 
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(a) NW(T)= V(T*). 

(b) &(T) is dense in H if and only if T is one-to-one. 

(c) T is invertible if and only if there exists 6 > 0 such that ||Tx|| > 6||x|| 
for every x € H. 

(d) If Tx = ax for some x € H,a € €, then T*x = ax. 

(e) If a and B are distinct eigenvalues of T, then the corresponding eigen- 
spaces are orthogonal to each other. 


PROOF. The equalities 
|| Tx||? = (Tx, Tx) = (T*Tx, x), 
| T*x||? = (T*x, T*x) = (TT*x, x), 

combined with the corollary to Theorem 12.7, prove the first state- 
ment, and (a) is an immediate consequence. Since &(T)~ = WV(T*), (a) 
implies (b). If there is a 6 >0 as in (c), then A(T) is closed, by 
Theorem 1.26, and is dense, by (b); hence @(T) = H and T is invert- 
ible. The converse follows from the open mapping theorem. To obtain 


(d), apply (a) to T—al in place of T. Finally, if Tx =ax and 
Ty = By, then (d) shows that 


a(x, y) = (ax, y) = (Tx, y) = (x, T*y) = (x, By) = Blx, y). 
Since a # B, we conclude that x | y. //// 


12.13 Theorem If U € &(H), the following three statements are equiva- 
lent. 


(a) U is unitary. 
(b) AU) = H and (Ux, Uy) = (x, y) for allx € H, y € H. 
(c) AU) =H and ||Ux|| = ||x|| for every x € H. 


PROOF. If U is unitary, then @(U)=H because UU* =]. Also, 
U*U = I, so that 


(Ux, Uy) = (x, U*Uy) = (x, y). 
Thus (a) implies (b). It is obvious that (b) implies (c). If (c) holds, then 
(U*Ux, x) = (Ux, Ux) = ||Ux||? = ||x|l* = &, x) 


for every x € H, so that U*U =I. But (c) implies also that U is a 
linear isometry of H onto H, so that U is invertible in @(H). Since 
U*U = 1, U~* = U*, and therefore U is unitary. //// 


314 part II: BANACH ALGEBRAS AND SPECTRAL THEORY 


Note: The equivalence of (a) and (b) shows that the unitary operators 
are precisely those linear isomorphisms of H that also preserve the inner 
product. They are therefore the Hilbert space automorphisms. 

The equivalence of (b) and (c) is also a corollary of Exercise 2. 

The preceding proof shows that an operator T € &(H) is an isometry 
(i.e., satisfies ||Tx|| = ||x|| for every x e€ H) if and only if T*T =I. This is 
one half of what is needed to be unitary, but it is not enough. For example, 
let T be the right shift S, on ¢” (see Exercise 2, Chapter 10) whose adjoint is 
easily seen to be S,. 


12.14 Theorem Each of the following four properties of a projection 
P € &H) implies the other three: 


(a) P is self-adjoint. 

(b) P is normal. 

(c) RP) = (PY. 

(d) (Px, x) = ||Px||* for every x € H. 


Moreover, two self-adjoint projections P and Q have &@P) | AQ) if 
and only if PQ = 0. 


Property (c) is usually expressed by saying that P is an orthogonal 
projection. 


PROOF. It is trivial that (a) implies (b). Statement (a) of Theorem 12.12 
shows that WV(P)= A&(P) if P is normal; since P is a projection, 
A(P) = V(I — P), so that A(P) is closed. It now follows from the 
corollary to Theorem 12.4 that (6) implies (c). 
If (c) holds, every x e H has the form x = y +z, with y 1 z, 
Py = 0, Pz = z. Hence Px = z, and (Px, x) = (z, z). This proves (d). 
Finally, assume (d) holds. Then 


|| Px||* = (Px, x) = (x, P*x) = (P*x, x). 
The last equality holds because || Px||? is real and (x, P*x) = ||Px|l*. 
Thus (Px, x) = (P*x, x), for every x € H, so that P = P*, by Theorem 
12.7. Hence (d) implies (a). 
The identity (Px, Oy) = (x, PQy) proves the last assertion. —//// 


12.15 Theorem 
(a) If U is unitary and 4 € a(U), then |A| = 1. 
(b) If S is self-adjoint and 4 € a(S), then A is a real number. 
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PROOF. (a) Theorem 12.13 shows that ||U|| = 1, and therefore |A| < 1 
if A € o(U). On the other hand, if || < 1, then ||AU*|| < 1; hence 


AI — U = —U(I — AU*) 


is invertible in @(H) (Theorem 10.7), and therefore 4 ¢ o(U). 
(b) Suppose S=S*, A=a+ifeo(S). Put S,=S—dAl. A 
simple calculation gives 


|S, |]? = Sx — axl]? + B*Ixl?, 


so that ||S,x|| >| B||lx\|. If B 4 0, it follows that S, is invertible [by (c) 
of Theorem 12.12], and thus 4 ¢ a(S). //// 


A Commutativity Theorem 


Let x and y be commuting elements in some Banach algebra with an invo- 
lution. It is then obvious that x* and y* commute, simply because 
x*y* = (yx)*. Does it follow that x commutes with y*? Of course, the 
answer is negative whenever x is not normal and y = x. But it can be nega- 
tive even when both x and y are normal (Exercise 28). It is therefore an 
interesting fact that the answer is affirmative (if x is normal) in @(H), rela- 
tive to the involution furnished by the Hilbert space adjoint: 


If Ne @A) is normal, if Te AH), and if NT =TN, then 
Nt SIN *, 


In fact, a more general result is true: 


12.16 Theorem (Fuglede-Putnam-Rosenblum) Assume that M, N, 
T € BA), M and N are normal, and 


(1) MT =TN. 
Then M*T = TN*. 


PROOF. Suppose first that S e @(H). Put V = S — S*, and define 
(2) Q =exp (V) = x (= Jv 

Then V* = — V, and therefore 

(3) Q* = exp (V*) = exp(—V) =Q. 

Hence Q is unitary. The consequence we need is that 


(4) lexp (S — S*)|| = 1 for every S € A(A). 


316 PART III: BANACH ALGEBRAS AND SPECTRAL THEORY 


If (1) holds, then M*T = TN* for k = 1, 2, 3, ..., by induction. 


Hence 

(5) exp (M)T = T exp (N), 
or 

(6) T = exp (—M)T exp (N). 


Put U, = exp (M* — M), U, =exp (N — N*). Since M and N are 
normal, it follows from (6) that 


(7) exp (M*)T exp (—N*)=U,TU,. 
By (4), ||U, || = ||U2|| = 1, so that (7) implies 
(8) llexp (M*)T exp (—N*)|| < |T]. 


We now define 
(9) f(A) = exp (AM*)T exp (—AN*) (Ae C). 


The hypotheses of the theorem hold with AM and AN in place of M 
and N. Therefore (8) implies that || f(A)|| < ||T || for every 4 € ¢. Thus f 
is a bounded entire #@(H)-valued function. By Liouville’s theorem 3.32, 
f(A) = f(0) = T, for every 4 € ¢. Hence (9) becomes 


(10) exp (AM*)T = T exp (AN*) (Ae). 

If we equate the coefficients of A in (10), we obtain M*T = TN*. — //// 
Remark. Inspection of this proof shows that it used no properties of 
2(H) which are not shared by every B*-algebra. This observation 


does not lead to a generalization of the theorem, however, because of 
Theorem 12.41. 


Note that the hypotheses of Theorem 12.16 do not imply that 
MT* = T*N, even when M and N are self-adjoint and T is normal: If 


al. 2 mae a oe a 
~ 10 -17 11 07 ~f—-1 1f 


then MT = TN but MT* $ T*N. 


Resolutions of the Identity 


12.17 Definition Let IM be a o-algebra in a set Q, and let H be a 
Hilbert space. In this setting, a resolution of the identity (on Mi) is a mapping 


E: Mt BA) 


with the following properties: 
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(a) E(O)=0, EQ) =I. 

(b) Each E(o) is a self-adjoint projection. 

(c) Ela’ 0 @w") = E(o')E(o”). 

(d) Ifw’ 0 ow” = @, then E(w’ U w”) = E(w’) + E(o”). 

(e) Forevery x e H and ye H, the set function E, , defined by 


E,, (@) = (E(@)x, y) 
is a complex measure on MN. 


When Wt is the o-algebra of all Borel sets on a compact or locally 
compact Hausdorff space, it is customary to add another requirement to 
(e): Each E, , should be a regular Borel measure. (This is automatically 
satisfied on compact metric spaces, for instance. See [23].) 

Here are some immediate consequences of these properties. 

Since each E(q) is a self-adjoint projection, we have 


(1) E,, (@) = (E(@)x, x) = ||E(@)x||?_— (x € A) 
so that each E, . is a positive measure on Mt whose total variation is 
(2) JE, sll = Ex, (Q) = ||xIl*. 


By (c), any two of the projections E(w) commute with each other. 

If a’ © w” = @, (a) and (c) show that the ranges of E(w’) and E(w”) 
are orthogonal to each other (Theorem 12.14). 

By (da), E is finitely additive. The question arises whether E is count- 
ably additive, 1.e., whether the series 


fe. @) 


(3) dX E(o,) 
converges, in the norm topology of @(A), to E(w), whenever w is the union of 
the disjoint sets w, € Mt. Since the norm of any projection is either 0 or at 
least 1, the partial sums of the series (3) cannot form a Cauchy sequence, 
unless all but finitely many of the E(w,) are 0. Thus E is not countably 
additive, except in some trivial situations. 

However, let {@w,} be as above, and fix x e H. Since E(a,)E(a,,) = 0 
when n # m, the vectors E(w,)x and E(q,,)x are orthogonal to each other 
(Theorem 12.14). By (e), 


(4 Y (E(@,)x, y) = (Ex, y) 


for every y € H. It now follows from Theorem 12.6 that 


[48 


(5) E(w,)x = E(@)x. 
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The series (5) converges in the norm topology of H. We summarize the 
result just proved: 


12.18 Proposition If E is a resolution of the identity, and if x € H, then 
aw — E(w)x 


is a countably additive H-valued measure on WM. 


Moreover, sets of measure zero can be handled in the usual way: 


12.19 Proposition Suppose E is a resolution of the identity. If w, € M 
and E(w,) = 0 for n = 1, 2, 3,..., and ifw = |)*, w,, then E(w) = 0. 


PROOF. Since E(w,) = 0, E, ,(w,) = 0 for every x e H. Since E, ,. 1s 
countably additive, it follows that FE, (w)=0. But ||E(@)x||? = 
E,, .(@). Hence E(w) = 0. //// 


12.20 The algebra L“(E) Let E be a resolution of the identity on M, 
as above. Let f be a complex Mt-measurable function on 9. There is a 
countable collection {D;} of open discs which forms a base for the topology 
of ¢. Let V be the union of those D; for which E(f ~*(D;)) = 0. By Proposi- 
tion 12.19, E(f ~*(V)) = 0. Also, V is the largest open subset of € with this 
property. 

The essential range of fis, by definition, the complement of V. It is the 
smallest closed subset of ¢ that contains f(p) for almost all p € Q, that is, 
for all p € Q except those that lie in some set wm € Wt with E(w) = 0. 

We say that f is essentially bounded if its essential range is bounded, 
hence compact. In that case, the largest value of |/|, as 4 runs through the 
essential range of f, is called the essential supremum || f ||, of f. 

Let B be the algebra of all bounded complex Wt-measurable functions 
on Q; with the norm 


fil = sup {| f(p)|: p € Qh, 


One sees easily that B is a Banach algebra and that 


N={feB: |fll. =9 


is an ideal of B which is closed, by Proposition 12.19. Hence B/N is a 
Banach algebra, which we denote (in the usual manner) by L*(E). 

The norm of any coset [f] =f+ N of L*(E) is then equal to || f||,,, 
and its spectrum o([f ]) is the essential range of f As is usually done in 
measure theory, the distinction between f and its equivalence class [ f ] will 
be ignored. 
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Our next concern will be the integration of functions with respect to 
the projection-valued measures described above. The resulting integrals 
| f dE turn out to be not only linear (as all good integrals ought to be) but 
also multiplicative! 


12.21 Theorem If E is a resolution of the identity, as above, then there 
exists an isometric*-isomorphism ‘¥ of the Banach algebra L*(E) onto a closed 
normal subalgebra A of @(H), which is related to E by the formula. 


(1) (P(f)x, y) = | rae, (x, y € H, fe L*(E)). 
This justifies the notation 
(2) P(t) = [7 dE. 
Moreover, 
(3) INM(P)xI1? = [ure dE,,  (xeH,fe L*(E)), 


and an operator Q € @(H) commutes with every E(w) if and only if Q com- 
mutes with every ‘P(f). 


Recall that a normal subalgebra A of @(H) is a commutative one 
which contains T* for every T € A. To say that Y is a *-isomorphism 
means that P is one-to-one, linear, and multiplicative and that 


(4) H(f)=HN)* (fe L*(E)). 


PROOF. To begin with, let {@,, ..., w,} be a partition of 9, with 
w; € Mt, and let s be a simple function, such that s=a; on a. 
Define ‘Y(s) € B(A) by 


(5) ¥(s) = Ya Blo). 
Since each E(q,) is self-adjoint, 
6) ¥(5)* = 5% Blo) = ¥O) 
i=1 
If {a ..., @,,} is another partition of this kind, and if t = B,; on 
w;, then 


W()¥(t) = Yo 6 E@)E(@') = Yo, B, Blo, 0 
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Since st is the simple function that equals a; 8; on w; A @j, it follows 
that 


(7) Y(s) P(t) = P(st). 

An entirely analogous argument shows that 

(8) Pas + Bt) = oP(s) + BY(0). 
If x e H and y € H, (5) leads to 


n 


(9) (H(s)x, y) = ) a(E(@))x, y) = Le E,. yoy = [ s dE, ,. 


By (6) and (7), 
(10) Y(s)*'P(s) = ‘P(S)P(s) = ‘P(5s) = P(|s|?). 
Hence (9) yields 


(11) || ‘¥(s)x||* = (P(s)*(s)x, x) = (H([s]?)x, x) = [ is? dE, x 


so that 
(12) || P(s)x|] < [S|] oll, 


by formula (2) of Section 12.17. On the other hand, if x € A(E(a,), 
then 


(13) V(s)x = a; E(w x = a,x, 

since the projections E(@,;) have mutually orthogonal ranges. If j is 
chosen so that | «;| = ||s||,,, it follows from (12) and (13) that 

(14) I P(s) ll = [Isl 


Now suppose fe L*(E). There is a sequence of simple measur- 
able functions s, that converges to fin the norm of L*(E). By (14), the 
corresponding operators ‘P(s,) form a Cauchy sequence in @(H) which 
is therefore norm-convergent to an operator that we call ‘P(/); it is 
easy to see that ‘¥(f) does not depend on the particular choice of {s,}. 
Obviously (14) leads to 


(15) IMP =f. Ge Lb). 


Now (1) follows from (9) (with s, in place of s), since each E, , 1s 
a finite measure; (2) and (3) follow from (6) and (11); and if bounded 
measurable functions f and g are approximated, in the norm of L”(E), 
by simple measurable functions s and t, we see that (7) and (8) hold 
with f and g in place of s and t. 
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Thus V is an isometric isomorphism of L°(E) into &(A). Since 
[~(E) is complete, its image A = ‘¥W(L*(E)) is closed in @(H), because of 
(15). 

Finally, if Q commutes with every E(w), then Q commutes with 
‘PY(s) whenever s is simple, and therefore the approximation process 
used above shows that Q commutes with every member of A. //// 


It is perhaps worth mentioning that the equality 


(16) I fll = sup 1 If? dE, .: xl < 1 


holds for every f € L*(E), because of (3) and (15). 


The Spectral Theorem 


The principal assertion of the spectral theorem is that every bounded 
normal operator T on a Hilbert space induces (in a canonical way) a 
resolution E of the identity on the Borel subsets of its spectrum o(T) and 
that T can be reconstructed from E by an integral of the type discussed in 
Theorem 12.21. A large part of the theory of normal operators depends on 
this fact. 

It should perhaps be stated explicitly that the spectrum o(T) of an 
operator T € &H) will always refer to the full algebra @(A). In other 
words, A € o(T) if and only if T — AI has no inverse in @(H). Sometimes we 
shall also be concerned with closed subalgebras A of @(H) which have the 
additional property that I e A and T* € A whenever T é A. (Such algebras 
are sometimes called *-algebras.) 

Let A be such an algebra, and suppose that T ¢ A and T~' € &(H). 
Since TT* is self-adjoint, o(TT*) is a compact subset of the real line 
(Theorem 12.15), hence does not separate ¢, and therefore o,(TT*) = 
o(T T*), by the corollary to Theorem 10.18. Since TT* is invertible in BA), 
this equality shows that (TT*)~' € A, and therefore T~’ = T*(TT*)~' is 
also in A. 

Thus T has the same spectrum relative to all closed *-algebras in @(A) 
that contain T. 

Theorem 12.23 will be obtained as a special case of the following 
result, which deals with normal algebras of operators rather than with indi- 
vidual ones. 


12.22. Theorem [If A is a closed normal subalgebra of BH) which con- 
tains the identity operator I and if A is the maximal ideal space of A, then the 
following assertions are true: 
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There exists a unique resolution E of the identity on the Borel subsets of 
A which satisfies 


(1) r= | tae 


for every T € A, where Tis the Gelfand transform of T. 


The inverse of the Gelfand transform (i.e., the map that takes T back to 
T) extends to an isometric *-isomorphism ® of the algebra L*(E) onto a 
closed subalgebra B of (A), B > A, given by 


(2) Of = [ys dE (fe L*(E)). 


Explicitly, ® is linear and multiplicative and satisfies 
(3) Of = (Of)*, IO) =f. (fe L*(B)). 


B is the closure [in the norm topology of @(H)] of the set of all finite 
linear combinations of the projections E(w). 


If wm < A is open and nonempty, then E(w) # 0. 


An operator S € @(H) commutes with every T € A if and only if S com- 
mutes with every projection E(q). 


PROOF. Recall that (1) is an abbreviation for 
(4) (Tx, y)= | tae. (x,y © H, T € A). 
A 


Since @(H) is a B*-algebra (Section 12.9), our given algebra A is 
a commutative B*-algebra. The Gelfand-Naimark theorem 11.18 
asserts therefore that T > Tis an isometric *-isomorphism of A onto 
C(A). 

This leads to an easy proof of the uniqueness of E. Suppose E 
satisfies (4). Since T ranges over all of C(A), the assumed regularity of 
the complex Borel measures E, , shows that each E, , is uniquely 
determined by (4); this follows from the uniqueness assertion that is 
part of the Riesz representation theorem ([23], Th. 6.19). Since, by 
definition, (E(@)x, y) = E, ,(@), each projection E(w) is also uniquely 
determined by (4). 

This uniqueness proof motivates the following proof of the exis- 
tence of E. If x € H and y € H, Theorem 11.18 shows that T— (Tx, y) 
is a bounded linear functional on C(A), of norm <||x|| |lyl|, since 
Tl. = ||T||. The Riesz representation theorem supplies us therefore 
with unique regular complex Borel measures yp, , on A such that 


(5) (Tx, y) = | Pau, (x, ye H, TEA), 
A 
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For fixed T, the left side of (5) is a bounded sesquilinear functional on 
H, hence so is the right side, and it remains so if the continuous func- 
tion Tis replaced by an arbitrary bounded Borel function f. To each 
such f corresponds therefore an operator ®f ce Z(H) (see Theorem 
12.8) such that 


(6) ((®f)x, y) = | fdu,, (x, ye H). 


Comparison of (5) and (6) shows that ®T= T. Thus © is an 
extension of the inverse of the Gelfand transform. 

It is clear that ® is linear. 

Part of the Gelfand-Naimark theorem states that T is self- 
adjoint if and only if Tis real-valued. For such T, 


| Tau. = (Tx, y) = (x, Ty) = (Ty, x) — | Pu, 
A A 


and this implies that pw, , = w,, ,. Hence 


((®f)x, y) = [7 duty, y = | Ff dpy, . = (Pf )y, x) = (x, (fy) 


for all x, y € H, so that 


(7) of = (Of)*. 
Our next objective is the equality 
(8) (fg) = (Bf )(Dg) 


for bounded Borel functions f, g on A. If Se A and TA, then 
(ST)* = ST: hence 


| SP au.) = (97 9) = | 8 dere, 
A A 


This holds for every § € C(A); hence the two integrals are equal if S is 
replaced by any bounded Borel function f. Thus 


a du, y = [s dutty, y = (@f)Tx, y) = (Tx, 2) = | Tau, 2 


where we put z = (®f)*y. Again, the first and last integrals remain 
equal if Tis replaced by g. This gives 


(D(fg)x, y) = | 1 du, y = [> du, 


= ((®g)x, z) = (®g)x, (®f)*y) = (O(F)P(g)x, y), 
and (8) is proved. 
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We are finally ready to define E: If w is a Borel subset of A, let 
X,, be its characteristic function, and put 


(9) E(w) = (7.0). 


By (8), E(@ A @’) = E(w)E(w’). With w’ = w, this shows that 
each E(w) is a projection. Since ®f is self-adjoint when f 1s real, by (7), 
each E(w) is self-adjoint. It is clear that E(@) = ®(0)=0. That 
E(A) = I follows from (5) and (6). The finite additivity of E is a conse- 
quence of (6), and, for all x, y e H, 


E,, (@) = (E(@)x, y) = | dp, y = Mx, (@). 


Thus (6) becomes (2). That ||®f || = || f||., follows now from Theorem 
12.21. 

This completes the proof of (a) and (b). 

Part (c) is now clear because every fe L*(E) is a uniform limit of 
simple functions (1.e., of functions with only finitely many values). 

Suppose next that w is open and E(w) = 0. If T € A and T has 
its support in @, (1) implies that T = 0; hence T= 0. Since A = C(A). 
Urysohn’s lemma implies now that w = @. This proves (d). 

To prove (e), choose S € AH), x € H, ye H, and put z = S*y. 
For any T € A and any Borel set w < A we then have 


(10) (STx, y) = (Tx, z) = | T dE, ., 
(11) (TSx, y) = | Pas, x 

(12) (SE(@)x, y) = (E(@)x, z) = E,, (@), 
(13) (E(w)Sx, y) = Es,, ,(@). 


If ST = TS for every T e€ A, the measures in (10) and (11) are 
equal, so that SE(w) = E(w)S. The same argument establishes the con- 
verse. This completes the proof. /T// 


We now specialize this theorem to a single operator. 


12.23 Theorem If T € &A) and T is normal, then there exists a unique 
resolution of the identity E on the Borel subsets of o(T) which satisfies 


(1) 


f= | 1 dE(A). 
a(T) 
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Furthermore, every projection E(w) commutes with every S € &(HA) 
which commutes with T. 


We shall refer to this E as the spectral decomposition of T. 
Sometimes it is convenient to think of E as being defined for all Borel 
sets in ¢; to achieve this put E(w) = 0if w A o(T) = ©. 


PROOF. Let A be the smallest closed subalgebra of AH) that contains 
I, T, and T*. Since T is normal, Theorem 12.22 applies to A. By 
Theorem 11.19, the maximal ideal space of A can be identified with 
o(T) in such a way that 7(A) = 4 for every J € o({T). The existence of E 
follows now from Theorem 12.22. 

On the other hand, if E exists so that (1) holds, Theorem 12.21 
shows that 


(2) p(T, T*) = | p(A, 4) dE(A), 
o(T) 

where p is any polynomial in two variables (with complex coefficients). 
By the Stone-Weierstrass theorem, these polynomials are dense in 
C(o(T)). The projections E(w) are therefore uniquely determined by 
the integrals (2), hence by T, just as in the uniqueness proof in 
Theorem 12.22. 

If ST = TS, then also ST* = T*S, by Theorem 12.16; hence S 
commutes with every member of A. By (e) of Theorem 12.22, 
SE(q@) = E(@)S for every Borel set w < o(T). //// 


12.24 The symbolic calculus for normal operators If E is the spectral 
decomposition of a normal operator T € &(H), and if f is a bounded Borel 
function on o(T), it 1s customary to denote the operator 


(1) Pf)= | sdk 
a(T) 


by f(T). 
Using this notation, part of the content of Theorems 12.21 to 12.23 
can be summarized as follows: 


The mapping f > f(T) is a homomorphism of the algebra of all bounded 
Borel functions on o(T) into @ A), which carries the function 1 to I, 
which carries the identity function on o(T) to T, and which satisfies 


(2) f(T) =f(T)* 


and 


(3) I F(T) < sup {| fA): 4 € of T)}. 
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If f € C(o(T)), then equality holds in (3), and therefore f— f(T) is an 
isomorphism on C(o(T)) which satisfies 


| f(T)x||? = | fl? dE,, x. 
a(T) 

If f,, ~f uniformly, then || f(T) — f(T)|| > 0, as n > oo. 

If S € BA) and ST = TS, then Sf(T) = f(T)S for every bounded 
Borel function f. 

Since the identity function can be uniformly approximated, on 
o(T), by simple Borel functions, it follows that T is a limit, in the norm 
topology of BA), of finite linear combinations of projections E(@). 


The following proof contains our first application of this symbolic 


calculus. 


12.25 Theorem If T € &(H) is normal, then 


| T || = sup {|(Tx, x)|: x € H, ||x|| < 1}. 


PROOF. Choose ¢ > 0. It is clearly enough to show that 
(1) |(TX9, Xo)| > TI] —« 


for some x, € H with ||xo|| = 1. 

Since ||T|| = ||T||,, = p(T) (Theorem 11.18), there exists A, € 
o(T) such that |A,| = || T'||. Let w be the set of all 2 € o(T) for which 
|A —Ao| <«. If E is the spectral decomposition of T, then (d) of 
Theorem 12.22 implies that E(w) # 0. Therefore there exists x) € H 

Define f(A) =A-—A, for AEw; put f(A)=0 for all other 
A € o(T). Then 

I(T) = (T = Ap NE(), 
so that 

S(T)x9 = TX9 — Ao Xo. 
Hence 


I(T Xo, Xo) — Ao] =1F(T)xo; Xo) S IFC S ¢, 


since | f(A)| < ¢ for all A € o(T). This implies (1), because | A,)| = || T'|. 
//// 


To see that normality is needed here, let T be the linear operator on 


¢* (with basis {e,, e,}) given by Te, =0, Te, =e. It has ||T|| = 1, but 
|(Tx, x)| < 5 if |x|] < 1. 


Our next result contains a converse to Theorem 12.15. 
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12.26 Theorem = 4A normal T € &(A) is 


(a) self-adjoint if and only if o(T) lies in the real axis, 
(b) unitary if and only if o(T) lies on the unit circle. 


PROOF. Choose A as in the proof of Theorem 12.23. Then T(A) =4 
and (T*)‘(A) = 4 on o(T). Hence T = T* if and only if 4 = A on o(T), 
and TT* = / if and only if AA = 1 on o(T). //// 


12.27 Invariant subspaces As in Theorem 10.35, a closed subspace M 
of H is an invariant subspace of a set ), < @(H) if every T € } maps M into 
M. For example, every eigenspace of T is an invariant subspace of T. When 
dim H < oo, the spectral theorem implies that the eigenspaces of every 
normal operator T span H. [Sketch of proof: The characteristic function of 
each point in o(T) corresponds to a projection in H. The sum of these 
projections is E(o(T)) = I.] If dim H = oo, it can happen that T has no 
eigenvalues (Exercise 20). But normal operators still have invariant sub- 
spaces that are nontrivial (that is, #{0} and #H). 

In fact, let A be a normal algebra, as in Theorem 12.22, and let E be 
its resolution of the identity, on the Borel subsets of A. If A consists of a 
single point, then A consists of the scalar multiples of J, and every subspace 
of H is invariant under A. Suppose that A = w U w’, where w and ow’ are 
nonempty disjoint Borel sets. Let M and M’ be the ranges of E(w) and 
E(@’). Then TE(w) = E(@)T for every T € A. If x € M, it follows that 


Tx = TE(@)x = E(w) Tx, 


so that Tx € M. The same holds for M’. 

Hence M and M' are invariant subspaces of A. 

Moreover, M’ = M+, and H =~M@M. 

Decompositions of A into finitely many (or even countably many) dis- 
joint Borel sets induce, in the same manner, decompositions of H into pair- 
wise orthogonal invariant subspaces of A. 

It is an open problem whether every (nonnormal) T € &(H) has a 
nontrivial invariant subspace if H is an infinite-dimensional separable 
Hilbert space. 


Eigenvalues of Normal Operators 


If T € AHA) is normal, its eigenvalues bear a simple relation to its spectral 
decomposition (Theorem 12.29). This will be derived from the following 
application of the symbolic calculus: 
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12.28 Theorem Suppose T € ZA) is normal and E is its spectral decom- 
position. If f € C(a(T)) and if a, = f ~*(0), then 


(1) M(F(T)) = BE(@o)). 


PROOF. Put g(A) = 1 on @o, g(A) = 0 at all other points of o(T). Then 
fg = 0, so that f(T)g(T) = 0. Since g(T) = E(w ), it follows that 
(2) R(E(@)) < N(f(T)). 


For each positive integer n, let w, be the set of all 4 € o(T) where 
1/n < | f(A)| < 1/f(n — 1). The complement @ of wy, relative to o(T), is 
then the union of the disjoint Borel sets w, . Define 


1/f (A) on @,, 
0 elsewhere on o(T). 


(3) (A) = 


Each f, is a bounded Borel function on o(T), and 


(4) FAT) S(T) = E(o,,) (n = 1, 2, 3, ...). 


If f(T)x = 0, it follows that E(w,)x = 0. The countable additivity 
of the mapping w — E(q@)x (Proposition 12.18) shows therefore that 
E(@)x = 0. But E(@) + E(@,) = I. Hence E(wo)x = x. We have now 
proved that 


(5) N(f(T)) < AE(@o)), 
and (1) follows from (2) and (5). I pif 


12.29 Theorem Suppose E is the spectral decomposition of a normal 
T € AA), Ay € o(T), and E, = E({A,$). Then 


(a) N(T — Aol) = R(Eo), 

(b) A, is an eigenvalue of T if and only if E, # 0, and 

(c) every isolated point of o(T) is an eigenvalue of T. 

(d) Moreover, if o(T) = {A,, A, 43, ...} is a countable set, then every x € H 
has a unique expansion of the form 


where Tx; = 4;,x;. Also, x; 1 x; whenever i ¥j. 


Statements (b) and (c) explain the term point spectrum of T for the set 
of all eigenvalues of T. 


PROOF. Part (a) is an immediate corollary of Theorem 12.28, with 
f(A) =A—A,. It is clear that (b) follows from (a). If 4) is an isolated 
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point of o(T), then {A} is a nonempty open subset of o(T); hence 
E, # 0, by (d) of Theorem 12.22. Therefore (c) follows from (b). 

To prove (d), put E; = E({/,;}), i = 1, 2, 3,.... At limit points A, of 
o(T), E; may or may not be 0. In any case, the projections E; have 
pairwise orthogonal ranges. The countable additivity of w— E(w)x 
(Proposition 12.18) shows that 


y E,x = E(o(T))x = x (x € H). 


t= 


The series converges, in the norm of H. This gives the desired repre- 
sentation of x, if x; = E;x. The uniqueness follows from the orthog- 
onality of the vectors x;, and Tx; = 4; x; follows from (a). //// 


12.30 Theorem 4A normal operator T € AH) is compact if and only if it 
satisfies the following two conditions: 


(a) o(T) has no limit point except possibly 0. 
(b) IfA #0, then dim 1(T — AI) < ~. 


PROOF. For the necessity, see (d) of Theorem 4.18, and Theorem 4.25. 

To prove the sufficiency, assume (a) and (b) hold, let {/;} be 
an enumeration of the nonzero points of o(T) such that |/,|> 
|A,| >---, define f(A) = A1f A = 4; and i < n, and put (4) = 0 at the 
other points of o(T). If E; = E({/;}), as in Theorem 12.29, then 

SAT) — ALE, + aaa + An En: 
Since dim A(E,;) = dim V(T — /A,;1) < 0, each f(T) 1s a compact 
operator. Since | A — f,(A)| < | 4, | for all 4 € o(T), we have 
IT —f(T)|| <|4,) 790 as noo. 


It now follows from (c) of Theorem 4.18 that T 1s compact. [TH 


We have tacitly assumed that o(T) 1s infinite. If o(T) contains only n 
points different from 0, then f(T) = T in the preceding proof, and Theorem 
4.18 is not needed. 


12.31 Theorem Suppose T € &(H) is normal and compact. Then 


(a) T has an eigenvalue 4 with |A| = ||T ||, 
(b) f(T) is compact if f € C(o(T)) and f (0) = 0, but 
(c) f(T) is not compact if f € C(o(T)), f(0) 4 0, and dim H = o. 
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PROOF. Since T is normal, Theorem 11.18 shows that there exists 
A € o(T) with |A| = || T|. If || T || > 0, this A 1s an isolated point of o(T) 
(Theorem 12.30), hence an eigenvalue of T (Theorem 12.29). If 
|| T || = 0, (a) is obvious. 

To prove (b), choose ¢>0, then 6>0 so that | f(A)| <« if 
|2| <6; let A,, ..., Ay be the points in o(T) for which |/;| > 6; find 
polynomials Q, (1 <k <N) so that Q,(A,) = 1, Q,(4,) = 0 for j # k, 
1 <j < N; and define 


P(A) = = FAN A/AY" QA), 


where M 1s a positive integer, so large that | P(A)| < « if |A| < 6. The 
polynomial P has / as a factor. Hence P(T) is a compact operator, by 
(f) of Theorem 4.18. Also, P(A,;) = f(A,;) for 1 <j < N. It follows that 
| P(A) — f(A)| < 2¢ for all A € o(T). Hence || P(T) — f(T)|| < 2, and (c) 
of Theorem 4.18 implies that f(T) is compact. 

In the proof of (c), assume f(0) = 1, without loss of generality. 
Then (b), applied to 1 — f, shows that the operator S = I — f(T) 1s 
compact. Let B be the unit ball of H. Then 


Bc S(B) + f(T)(B). 


If f(T) were compact, it would follow that B lies in the sum of two 
compact sets; hence H would be locally compact, hence finite- 
dimensional, contrary to our hypothesis. //// 


Positive Operators and Square Roots 
12.32 Theorem Suppose T € &H). Then 


(a) (Tx, x) > 0 for every x € H if and only if 
(b) T = T* and o(T) c [0, oo). 


If T € BA) satisfies (a), we call T a positive operator and write T > 0. 
The theorem asserts that this terminology agrees with Definition 11.27. 


PROOF. In general, (Tx, x) and (x, Tx) are complex conjugates of each 
other. But if (a) holds, then (Tx, x) is real, so that 
(x, T*x) = (Tx, x) = (x, Tx) 


for every x € H. By Theorem 12.7, T = T*, and thus o(T) lies in the 
real axis (Theorem 12.15). If A > 0, (a) implies that 


A\|x||? = (Ax, x) < (T + Al)x, x) < |\(T + AD)x|||| xl, 
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so that 
I(T + AD)x|| = Al|x|. 


By Theorem 12.12(c), T + AI is invertible in Z(H), and —A is not in 
o(T). It follows that (a) implies (b). 

Assume now that (b) holds, and let E be the spectral decomposi- 
tion of T, so that 


(Tx, x) = | A dE, (A) (x € H). 
a(T) 


Since each E, , is a positive measure, and since 4 > 0 on o(T), 
we have (Tx, x) > 0. Thus (b) implies (a). ae 


12.33. Theorem Every positive T €¢ B(H) has a unique positive square 
root S € &H). If T is invertible, so is S. 


PROOF. Let A be any closed normal subalgebra of @(H) that contains 
I and T, and let A be the maximal ideal space of A. By Theorem 
11.18, A = C(A). Since T satisfies condition (b) of Theorem 12.32, and 
since o(T) = T{A), we see that T'> 0. Since every nonnegative contin- 
uous function has a unique nonnegative continuous square root, it 
follows that there is a unique S € A that satisfies S? = T and S$ > 0; 
by Theorem 12.32, § > 0 is equivalent to S > 0. 

In particular, let Ag be the smallest of these algebras A. Then 
there exists Sy € A, such that S$ = T and S, > 0. If S e B(H) is any 
positive square root of T, let A be the smallest closed subalgebra of 
Z(H) that contains J and S. Then T e€ A, since T = S*. Hence Ay < A, 
so that Sy € A. The conclusion of the preceding paragraph shows now 


that S = So . 
Finally, if T is invertible, then S~* = T~ +S, since S commutes 
with S? = T. III 


12.34 Theorem If T € &(H), then the positive square root of T*T is the 
only positive operator P € &(A) that satisfies ||Px|| = ||Tx|| for every x € H. 
PROOF. Note first that 
(1) (T*Tx, x) =(Tx, Tx) =||Tx|?>0 (x eH), 


so that T*T > 0. (In the more abstract setting of Theorem 11.28 this 
was much harder to prove!) 
Next, if P € @(H) and P = P*, then 


(2) (P?x, x) = (Px, Px) = ||Px||? (x € H). 
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By Theorem 12.7, it follows that ||Px|| = ||Tx|| for every x e€ H if and 
only if P? = T*T. 
This completes the proof. //// 


The fact that every complex number 2 can be factored in the form 
A = a|A|, where | «| = 1, suggests the problem of trying to factor T € @(A) 
in the form T = UP, with U unitary and P > 0. When this is possible, we 
call UP a polar decomposition of T. 

Note that U, being unitary, is an isometry. Theorem 12.34 shows 
therefore that P is uniquely determined by T. 


12.35 Theorem 

(a) If T € BA) is invertible, then T has a unique polar decomposition 
tf ==0 FP, 

(b) If T € ZA) is normal, then T has a polar decomposition T = UP in 
which U and P commute with each other and with T. 


PROOF. (a) If T 1s invertible, so are T* and T*T, and Theorem 12.33 
shows that the positive square root P of T*T is also invertible. Put 
U = TP’. Then U is invertible, and 


VU SPP a PPP Si 


so that U is unitary. Since P is invertible, it is obvious that TP~' is 
the only possible choice for U. 

(b) Put p(A) = |A|, u(A) = A/|A| if A 40, u(O0) = 1. Then p and u 
are bounded Borel functions on o(T). Put P = p(T), U = u(T). Since 
p>=>0O, Theorem 12.32 shows that P>O. Since uu=1, UU* = 
U*U =I. Since 2 = u(A)p(A), the relation T = UP follows from the 
symbolic calculus. //// 


Remark. It is not true that every T € @(H) has a polar decomposi- 
tion. (See Exercise 19.) However, if P is the positive square root of 
T*T, then || Px|| = || Tx|| for every x e H; hence Tx = Ty if Px = Py, 
by linearity. The formula 


VPx = Tx 


defines a linear isometry V of @(P) onto &(T), which has a contin- 
uous extension to a linear isometry of the closure of @(P) onto the 
closure of &(T). 

If there is a linear isometry of &(P)~ onto &(T)*, then V can be 
extended to a unitary operator on H, and then T has a polar decom- 
position. This always happens when dim H < oo, since @(P) and A(T) 
have then the same codimension. 
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If V is extended to a member of &(H) by defining Vy = 0 for all 
y € AP)’, then V is called a partial isometry. 

Every T € &(H) thus has a factorization T = VP in which P is 
positive and V is a partial isometry. 

In (a), no two of T, U, P need to commute. For example, 


cotis 


In combination with Theorem 12.16, the polar decomposition 
leads to an interesting result concerning similarity of normal oper- 
ators. 


12.36 Theorem Suppose M, N, T € &H), M and N are normal, T is 
invertible, and 


(1) M =TNT“. 
If T = UP is the polar decomposition of T, then 
(2) M = UNU“t. 


Two operators M and N that satisfy (1) are usually called similar. If U 
is unitary and (2) holds, M and WN are said to be unitarily equivalent. The 
theorem thus asserts that similar normal operators are actually unitarily 
equivalent. 


PROOF. By (1), MT = TN. Hence M*T = TN*, by Theorem 12.16. 
Consequently, 
T*M =(M*T)* = (TN*)* = NT*, 
so that 
NP? = NT*T = T*MT = T*TN = P?2N, 


since P? = T*T. Hence N commutes with f(P7”), for every f € C(o(P”)). 
(See Section 12.24.) Since P > 0, o(P?) < [0, oo). If f(A) = 41/2 > 0 on 
a(P7), it follows that NP = PN. Hence (1) yields 


M =(UP)N(UP)~! = UPNP~1U-! = UNU“?, MI) 
The Group of Invertible Operators 


Some features of the group of all invertible elements in a Banach algebra A 
were described at the end of Chapter 10. The following two theorems 
contain further information about this group, in the special case A = A(H). 


12.37 Theorem The group G of all invertible operators T € @(H) is con- 
nected, and every T € G is the product of two exponentials. 
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Here an exponential is, of course, any operator of the form exp (S) 


with S ¢ BH). 


PROOF. Let T = UP be the polar decomposition of some T é G. 
Recall that U is unitary and that P is positive and invertible. Since 
o(P) < (0, oo), log is a continuous real function on o(P). It follows 
from the symbolic calculus that there is a self-adjoint S ¢ @(H) such 
that P = exp (S). Since U is unitary, o(U) lies on the unit circle, so 
that there is a real bounded Borel function f on o(U) that satisfies 


exp {if} =4 [Ae o(U)]. 


(Note that there may not exist any continuous f with this property! ) 
Put OQ = f(U). Then Q € AH) 1s self-adjoint, and U = exp (iQ). Thus 


T = UP = exp (iQ) exp (S). 


From this it follows easily that G is connected, for if T; is defined, for 
O<r<1l,by 


T, = exp (irQ) exp (rS) 


then r — T, is a continuous mapping of the unit interval [0, 1] into G, 
To = I, and T, = T. This completes the proof. //// 


It is now natural to ask whether every T € G is an exponential, rather 


than merely the product of two exponentials. In other words, is every 
product of two exponentials an exponential? The answer is affirmative if 
dim H < oo; in fact, it is affirmative in every finite-dimensional Banach 
algebra, as a consequence of Theorem 10.30. But in general the answer is 
negative, as we shall now see. 


12.38 Theorem Let D be a bounded open set in ¢ such that the set 


(1) 


Q= {ae C: «* € D} 


is connected and such that 0 is not in the closure of D. Let H be the space of 
all holomorphic functions f in D that satisfy 


(2) 


[LPP dm, <0 
D 


(where m, is Lebesgue measure in the plane), with inner product 


(3) 


(f; a= | faam, 


Then H is a Hilbert space. Define the multiplication operator M € &(A) by 


(4) 


(Mf\(z)=2f(z) (fe H, ze D). 


Then M is invertible, but M has no square root in &(A). 
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Since every exponential has roots of all orders, it follows that M is not 
an exponential. 


PROOF. It is clear that (3) defines an inner product that makes H a 
unitary space. We show now that H is complete. Let K be a compact 
subset of D, whose distance from the complement of D if 6. If z € K, 
if A is the open circular disc with radius 6 and center z, and if f(¢) = 
y\ a,(¢ — 2)" for ¢ € A, a simple computation shows that 


i 1 
(5) Ent ytlaPomt?=2 | fp dm. 
=0 ™ Ja 


n 


Since f(z) = dy, it follows that 
(6) If@l<n'7o“|fll @ek, fe H), 


where || f || = (f, f)'/*. Every Cauchy sequence in H converges there- 
fore uniformly on compact subsets of D. From this it follows easily 
that H is complete. Hence H is a Hilbert space. 

Since D is bounded, M € &(HA). Since 1/z is bounded in D, 
M~‘«€ &(H). 

Assume now, to reach a contradiction, that M = Q* for some 
QO € ZH). Fix « € Q. Put A = «?. Then 4 € D. Define 


(7) M,=M —-AI, S=Q- al, T=Q+-+ dal, 
so that 
(8) ST =M,=TS. 


Since we are dealing with holomorphic functions, the formula 


(9) (M,9)2)=(2—Ag(z) (ze D, ge A) 


shows that M, is one-to-one and that its range @(M,) consists of 
exactly those f € H that satisfy f(A) = 0. Hence (6) shows that &(M,) is 
a closed subspace of H, of codimension 1. 

Since M, is one-to-one, the first equation (8) shows that T is 
one-to-one; the second shows that S 1s one-to-one. Since @(M,) # H, 
M, is not invertible in @(H). Hence at least one of S and T 1s not 
invertible. Suppose S 1s not invertible. Since M, = ST, A(M,) < AS), 
so that A&(S) is either @(M,) or H. In the latter case, the open mapping 
theorem would imply that S is invertible. Hence S is a one-to-one 
mapping of H onto @(M,). But the equation M, = ST shows that S 
maps A(T) onto &(M,). Hence @(T) = H, and another application of 
the open mapping theorem shows that T~ + € A(H). 

We have now proved that one and only one of the operators S 
and T is invertible in @(H). Therefore exactly one of the numbers « 
and —a lies in o(Q), if « € . It follows that Q is the union of two 
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disjoint congruent sets, a0(Q) 0 Q and —o(Q) 4 Q, both of which are 
closed (relative to Q) since o(Q) is compact. The assumption that 
M = Q? leads thus to the conclusion that Q is not connected, which 
contradicts the hypothesis. 

This completes the proof. //// 


The simplest example of a region D that satisfies the hypothesis of 
Theorem 12.38 is a circular annulus with center at 0. In that case, a more 
conceptual proof can be given. See Exercise 40. 


A Characterization of B*-algebras 


The fact that every @(H) is a B*-algebra has been exploited throughout this 
chapter. We shall now establish a converse (Theorem 12.41) which asserts 
that every B*-algebra (commutative or not) is isometrically *-isomorphic to 
some closed subalgebra of some @(H). The proof depends on the existence 
of a sufficiently large supply of positive functionals. 


12.39 Theorem If A is a B*-algebra and if z € A, then there exists a 
positive functional F on A such that 


(1) F(e) = 1 and ~~ F(zz*) = ||z||’. 


PROOF. Set zz* = x,. By (e) of Theorem 11.28, a(x.) < [0, 00). Let Ag 
be the maximal ideal space of the closed algebra A, < A generated by 
e and x,. Then A, = C(A,) and (by Theorem 11.19) % is a non- 
negative real continuous function on A,. It attains its maximum at 
some point h € Ay. Thus 


(2) Koh) = IXollo = IlxXoll = I1z1l?. 
Define a linear functional fon Ay by f(x) = x(h). Then 
(3) fe=1, f(ez*)= ||’, 


and || f || = 1, because | f(x)| < ||X!|, = ||x]| for all x € Ag. 
The Hahn-Banach theorem extends f to a linear functional F on 
A, with || F|| = 1. We have to prove that F(yy*) > 0 for every y € A. 
Fix ye A and let A, be the maximal ideal space of the closed 
algebra A, < A generated by e and yy*. Then A, = C(A,). Use F to 
define a linear functional @ on C(A,) by setting 


(4) O(X) = F(x) (x € Aj). 


Then g(1) = Fle) =f(e) = 1, | e&)| < |x| = Ixll., hence gl] = 1, 
and now Lemma 5.26 shows that g(x) > 0 for all x € A for which 
x >0 on A,. If x, = yy*, we see, as at the start of this proof, that 
Xx, => OonA,. Thus F(yy*) = F(x,) = o(%,) = 0, as needed. H/// 
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12.40 Theorem If A is a B*-algebra and if ue A, u #0, there exists a 
Hilbert space H, and there exists a homomorphism T,, of A into @(H,) that 
satisfies T,(e) = I, 


T(x") = T(x)* (x € A), 
| Tx)Il < [lx (& € A), 


and ||T,{u)|| = ||ull. 


PROOF. We regard u as fixed and omit the subscripts u. Fix a positive 
functional F on A that satisfies 


(3) F(e) = 1 and =—s F(u*u) = |\u||?. 
Such an F exists, by Theorem 12.39. Define 


(4) Y = {ye A: F(xy) = 0 for every x € A}. 


Since F is continuous (Theorem 11.31), Y is a closed subspace of A. 
Denote cosets of Y, that is, elements of A/Y, by x’: 


(5) '=x+Y (x € A). 
We claim that 
(6) (a’, b’) = F(b*a) 


defines an inner product on A/Y. 

To see that (a’, b’) is well defined by (6), 1.e., that it 1s independent 
of the choice of representatives a and b, it is enough to show that 
F(b*a) = 0 if at least one of a or b lies in Y. If ae Y, F(b*a) =0 
follows from (4). If b € Y, then 


(7) F(b*a) = F(a*b) = 0, 


by (a) of Theorem 11.31 and another application of (4). Thus (a’, b’) is 
well defined, it is linear in a’, and conjugate-linear in b’, and 


(8) (a’, a’) = F(a*a) = 0, 


since F is a positive functional. If (a’, a’) = 0, then F(a*a) = 0; hence 
F(xa) = 0 for every x € A, by (b) of Theorem 11.31, so that a € Y and 
‘= 0. 
A/Y is thus an inner product space, with norm ||a’|| = F(a*a)*/?. 
Its completion H is the Hilbert space that we are looking for. We 
define linear operators T(x) on A/Y by 


(9) T(x)a’ = (xay. 
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Again, one checks easily that this definition is independent of the 
choice of a € a’, for if y € Y, (4) implies that xy € Y. (Y 1s a left ideal 
in A.) It is obvious that x — T(x) is linear and that 


(10) T(x,)T(x2) = T(x, x2) (x; € A, x2 € A); 

in particular, (9) shows that T(e) is the identity operator on A/Y. We 
now claim that 

(11) ITO) <x ( € A). 

Once this is shown, the uniform continuity of the operators T(x) 


enables us to extend them to bounded linear operators on H. Note 
that 


(12) | T(x)a’||* = (xay’, (xa)’) = F(a*x*xa). 


For fixed a € A, define G(x) = F(a*xa). Then G is a positive functional 
on A, so that 


(13) G(x*x) < G(e)||x\l?, 

by (d) of Theorem 11.31. Thus 

(14) || T(x)a’ ||? = G(x*x) < F(a*a)||x\|? = lla’? |x’, 
which proves (11). 


Next, the computation 
(T(x*)a’, b’) = ((x*ay’, b’) = F(b*x*a) = F((xb)*a) 
= (a’, (xb)’) = (a’, T(x)b*) = (T(x)*a, 5’) 
shows that T(x*)a’ = T(x)*a’, for all a’ € A/Y. Since A/Y is dense in 
H, this proves (1). 
Finally, (3) and (12) show that 
(15) jul? = F(u*u) = || Twe'|? < ITM? 


since |le’||7 = F(e*e) = F(e) = 1. In conjunction with (11), (15) gives 
| T(u)|| = |u|], and the proof 1s complete. //// 


12.41 Theorem [fA is a B*-algebra, there exists an isometric *-isomor- 
phism of A onto a closed subalgebra of @(H), where H is a suitably chosen 
Hilbert space. 


PROOF. Let H be the “direct sum” of the Hilbert spaces H, con- 
structed in Theorem 12.40. Here is a precise description of H: Let 
m,{v) be the H,-coordinate of an element v of the cartesian product of 
the spaces H,. Then, by definition, v € H if and only if 


(1) ¥, [Ix,(0)||? < 0, 
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where ||z,(v)|| denotes the H,-norm of z,(v). The convergence of (1) 
implies that at most countably many z,(v) are different from 0. The 
inner product in H is given by 


(2) (v', v”) =) (20), 2,0") (v, v" € A), 


so that ||v||? = (v, v) is the left side of (1). We leave it as an exercise to 
verify that all Hilbert space axioms are now satisfied by H. 

If S, ¢ A(H,), if ||S,|| <M for all u, and if Sv is defined to be the 
vector whose coordinate in H, 1s 


(3) 1,(Sv) = S,7,(v), 
one verifies easily that Sv ec H if v e H, that S e A(HA), and that 
(4) |S|| = sup ||S,]. 


We now associate with each x € A an operator T(x) € ZH), by 
requiring that 


(5) 1,(T(x)v) = T(x)(7,(v)), 
where T, is as in Theorem 12.40. Since 
(6) ITA) < [lll = ITIL, 
by Theorem 12.40, it follows from (4) that 
(7) | T(x)|| = uP | T(x) |] = I. 
That the mapping x — T(x) of A into @(H) has the other 


required properties follows from a coordinatewise application of 
Theorem 12.40. [// 


An Ergodic Theorem 


12.42 Definitions The term “ergodic” comes from statistical mechan- 
ics, where it is applied to systems in which “time average = space average” 
holds for certain quantities. To see a simple mathematical example, let yz be 
a probability measure on some o-algebra .@ in a set Q, let Wy map Q into Q, 
and define its iterates by wy’ = W, w" = W o w""! (n = 2, 3, 4,...). If we think 
of time as discrete, the “time average” of a function f on Q, relative to the 
transformation y, is 


l 
lim —(f+fopt-+fe yr t) 


n7>o@ 


when this limit exists in some sense. 


The “space average” of an f € L'(u) is simply |g f dy. 
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We will be concerned with measure-preserving one-to-one maps w of Q 
onto Q. This means that W(E) and w~‘(E) are in .@ for every E € .@ and 
that their measure is p(E). It is then clear that 


(2) [rw aus | ray 


for every f € L’(11). 

If, moreover, w(E) = E € .@ occurs only when p(E) = 0 or p(E) = 1, 
then w is said to be ergodic. In that case it is clear that every measurable 
function g for which g > WwW = g ae. [1] 1s constant a.e. [py]. 

We can now state von Neumann’s mean ergodic theorem; it is so 
named because I?-convergence used to be called “convergence in the 
mean.” 


12.43 Theorem Let (Q, .Z, yw) be as above. If wy: Q— Q is one-to-one and 
measure-preserving, and f € I?(1), then the averages 


A, f= (ft fewer t four 


converge, in the I?-metric, to some g € I?(u), as n > 00. 
Moreover, g ° W = g. Thus g is the constant |, f du if is ergodic. 


It is clear that the second assertion follows from the first. The first one 
says, explicitly, that 


lim | 1 ASP du =o. 
2 


n> 


The key to the proof is the observation that the map f— fo w is an 
isometry of I7(u) onto L(y). It is thus a unitary operator on the Hilbert 
space I?(u). The following abstract reformulation of Theorem 12.43 is then 
an easy consequence of the spectral theorem. 


12.44 Theorem I/fU € &(A) is unitary and x € H, then the averages 
1 
(1) Anx == (x + Ux +--+ U"*x) 


converge, in the norm topology of H, to some y € H. 


PROOF. Let E be the spectral decomposition of U. Define functions a, 
and b on the unit circle by 


(2) afd) =— (1+ A+ AY), 


b(1) = 1, b(A) = O for A # 1. 
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Then A, x = a,(U)x. Set y = b(U)x. This gives 


(3) ly — A, x||? = ||b(U)x — a,(U)x||? = | |b —a,|? dE, .. 
a(U) 
Since |b — a,| < 1 on the unit circle, and (b — a,)(A) > 0 pointwise, the 
dominated convergence theorem shows that 
(4) lim lly — A, x|| = 0. //// 


noo 


Exercises 


Throughout these exercises, the letter H denotes a Hilbert space. 

1. The completion of an inner product space is a Hilbert space. Make this state- 
ment more precise, and prove it. (See the proof of Theorem 12.40 for an applica- 
tion.) 

2. Suppose N is a positive integer, « € ¢, «oY = 1, and a? £1. Prove that every 
Hilbert space inner product satisfies the identities 


N 
(x, y)=— > lx + a"yl||/7a" 
n=1 


i oe 


and 


1 {” ; 
(x, y) = > | lx + elle” dé. 
5 jae eae 


Generalize this: Which functions f and measures pw on a set Q give rise to the 
identity 


(x, y) = | IIx + f(p)yll* du(p)? 


3. (a) Assume x, and y, are in the closed unit ball of H, and (x,, y,) > 1 as n— oo. 
Prove that then ||x, — y,|| — 0. 
(b) Assume x, ¢€H, x,—-x weakly, and ||x,|| > ||x||. Prove that then 
IIx, — x|| +0. 
4. Let H* be the dual space of H; define y: H* > H by 


y*(x) = (x, Wy*) = (x € H, y* € H*). 


(See Theorem 12.5.) Prove that H* is a Hilbert space, relative to the inner 
product 


[x*, y*] = (Wy*, wx"). 


If ¢: H** > H* satisfies z**(y*) = [y*, @z**] for all y* e H* and z** e€ H**, 
prove that w@ is an isomorphism of H** onto H whose existence implies that H 
is reflexive. 
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Suppose {u,} is a sequence of unit vectors in H (that is ||u,|| = 1), and assume 
that 


I? = ¥ |(u,, u) P<. 


i#j 


If {a,;} is any sequence of scalars, prove that 


n 
a a; U; 
t=m 


and deduce that the following three properties of {«;} are equivalent to each 
other: 


[e @) 
(a) >, |a;|? < 00. 
i=1 


<(14+D ¥ lm, 


i=m 


(1—T) Di lal? < 


i=m 


(b) > a,u; converges, in the norm of H. 
i=1 

(c) > afu;, y) converges, for every y € H. 
i=1 


This generalizes Theorem 12.6. 


. Suppose E is a resolution of the identity, as in Section 12.17, and prove that 


|E, (a) |? < E,, (@)E,, (@) 


forallxe H, ye H,andwe M. 


7. Suppose U € @(H) is unitary, and ¢ > 0. Prove that scalars a), ..., a, can be 
chosen so that 
|U-* —a oI —a,U —-:-—a,U"|| <e, 
if o(U) is a proper subset of the unit circle, but that this norm is never less than 
1 if o(U) covers the whole circle. 
8. Prove Theorem 12.35 with PU in place of UP. 


11. 


12. 


13. 


14, 


. Suppose T = UP is the polar decomposition of an invertible T € @(H). Prove 


that T is normal if and only if UP = PU. 


. Prove that every normal invertible T € @(H) is the exponential of some normal 


S € BA). 

Suppose N € @(H) is normal, and T € &(H) is invertible. Prove that TNT? is 

normal if and only if N commutes with T*T. 

(a) Suppose S € &@(A), T € BHA), S and T are normal, and ST = TS. Prove that 
S + T and ST are normal. 

(b) If, in addition, S > 0 and T > 0 (see Theorem 12.32), prove that S$ + T >0 
and ST > 0. 

(c) Show, however, that there exist S > 0 and T > 0 such that ST is not even 
normal (of course, then ST # TS). In fact, such examples exist if dim H = 2. 

If T € BA) is normal, show that T* = UT, for some unitary U. When is U 

unique? 

Assume T € &(H) and T*T is a compact operator. Show that T is then 

compact. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 
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Find a noncompact T € @(H) such that T? = 0. Can such an operator be 
normal? 

Suppose T € @(H) is normal, and o(T) is a finite set. Deduce as much informa- 
tion about T from this as you can. 

Show, under the hypotheses of (d) of Theorem 12.29, that the equation Ty = x 
has a solution y é€ H if and only if 


@ 
[Aj 177 I}ee,I? < ©. 
i=1 


(If A; = 0 for one i, then x; must be 0, for this i.) 


The spectrum o(T) of T € @(H) can be divided into three disjoint pieces: 
The point spectrum o,(T) consists of all Ae ¢ for which T — AI is not 
one-to-one. 
The continuous spectrum o,(T) consists of all 2 € ¢ such that T— Alisa 
one-to-one mapping of H onto a dense proper subspace of H. 
The residual spectrum o,(T) consists of all other A € o(T). 
(a) Prove that every normal T € @(H) has empty residual spectrum. 
(b) Prove that the point spectrum of a normal T € &(H) is at most countable, if 
H 1s separable. 
(c) Let Sp and S, be the right and left shifts (as defined in Exercise 2 of Chapter 
10), acting on the Hilbert space /”. 
Prove that (S,)* = S, and that 


o(S1) = 0,(Sq) = (4: |Al <1}, 
o(S,) = o(Sg) = (4: |A] = 1}, 
o(S1) = 0,(Sx) =D. 


Let Sp and S, be as above. Prove that neither S, nor S, has polar decomposi- 
tions UP, with U unitary and P > 0. 

Let u be a positive measure on a measure space Q, let H = L’(u), with the usual 
inner product 


(f, a= | fod 


For ¢ € L(y), define the multiplication operator M, by M,(f) = ¢f. Then 
M, € AA). 

Under what conditions on @ does M, have eigenvalues? Give an example 
in which o(M,) = o,(M,). Show that every M, is normal. What is the relation 
between o(M,) and the essential range of #? Show that ¢ > M, is an isometric 
*-isomorphism of L(y) onto a closed subalgebra A of @(H). (Certain pathologi- 
cal measures py have to be excluded in order to make this last statement correct.) 
Is A a maximal commutative sub-algebra of @(H)? Hint: If T € @A) and 
TM, = M,T for all ¢ € L*(y), and if u(Q) < oo, show that T is a multiplication 
by T(1) and hence that T e€ A. 

Suppose T € @(H) is normal, A is the closed subalgebra of @(H) generated by J, 
T, and T*, and T can be approximated, in the norm topology of @(H), by finite 
linear combinations of projections that belong to A. 
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22. 


23. 


24. 


25. 


26. 


27. 


28. 
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Under what (necessary and sufficient) conditions on o(T) does this 
happen? 
Does every normal T € @(H) have a square root in @(H)? What can you say 
about the cardinality of the set of all square roots of T? Can it happen that two 
square roots of the same T do not commute? Can this happen when T = !? 
Show that the Fourier transform f—f is a unitary operator on 12(R"). What is 
its spectrum? Suggestion: When n = 1, compute the Fourier transforms of 


D3 \f:d\" ‘ Z 
exp (5\(2) exp (—x*) (m = 0, 1, 2, ...). 


Show that any two infinite-dimensional separable Hilbert spaces are isometri- 
cally isomorphic (via countable orthonormal bases; see [23]). Show that the 
space H in Theorem 12.38 is separable. Show that the answer to the question 
that precedes Theorem 12.38 is therefore negative for every infinite-dimensional 
H, separable or not. 

Suppose T € @(H) is normal, f is a bounded Borel function on o(T), and 
S = f(T). If E; and E, are the spectral decompositions of T and S, respectively, 
prove that 


Ew) = E7(f~ "(e)) 


for every Borel set w < o(S). 
If S € BH) and T € AA), the notation S > T means that S — T > 0, that is, 
that 


(Sx, x) > (Tx, x) 


for all x € H. Prove the equivalence of the following four properties of a pair of 
self-adjoint projections P and Q: 

(a) P>Q. 

(b) AP) > AQ). 

(c) PQ=Q. 

(dq) QP =Q. 


If E is a resolution of the identity, it follows that E(w’) > E(w”) if @’ > w”. 


Suppose * is an involution in a complex algebra A, q is an invertible element of 
A such that g* = q and x* is defined by 
x # = q7 ‘x*q 


for every x € A. Show that * is an involution in A. 
Let A be the algebra of all complex 4-by-4 matrices. If M = (m,,) € A, let M* be 
the conjugate transpose of M: m* = m,;. Put 


00 0 1 0 0 0 0 00 0 0 

001 0 100 0 0 0 0 0 
QO — S => T = . 

0 1 0 0 00 0 0 00 0 0 

1 0 0 0 00 0 0 00 0 1 


As in Exercise 27, define 


M*=Q"-'M*Q  (Mé A). 


29. 


30. 


31. 


32. 
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(a) Show that S and T are normal, with respect to the involution *, that 
ST = TS, but that ST* #4 T*S. 

(b) Show that S + T is not *-normal. 

(c) Compare ||SS* || with ||S||. 

(d) Compute the spectral radius p(S + S*); show that it is different from 
|S + S* |]. 

(e) Define V = (u,,) € A so that v,. = 024 = i, v3, = 043 = —i, v,; = 0 otherwise. 
Compute o(VV*); it does not lie in [0, 00). 

Part (a) shows that Theorem 12.16 fails for some involutions. Part (b) 
does the same for part (a) of Exercise 12; (c), (d), and (e) show that various parts 
of Theorem 11.28 fail for the involution *. 

Let X be the vector space of all trigonometric polynomials on the real line: 
these are functions of the form 


f(t) = c,e" ooo ceo", 


where s, € Rand c, € ¢, for 1 < k <n. Show that 
ae eee 
(f,9)= lim — | F (t)g(t) at 
A>a@ 2A —A 


exists and defines an inner product on X, that 


IF? =GA=leal? +-°> + Leal’, 


and that the completion of X is a nonseparable Hilbert space H. Show that H 
contains all uniform limits of trigonometric polynomials; these are the so-called 
“almost-periodic” functions on R. 

Let H,, be an infinite-dimensional Hilbert space, with its weak topology. Prove 
that the inner product is a separately continuous function on H,, x H,, which is 
not jointly continuous. 

Assume T, € &@(H) for n = 1, 2, 3,..., and 


lim ||T, x|| = 0 


n> 
for every x € H. Does it follow that 


lim || T* x|| =0 
for every x € H? 
Let X be a uniformly convex Banach space. This means, by definition, that the 
assumptions 


Ix <1, [yall <4, x, + yl > 2 


imply that ||x, — y,|| > 0. 
For example, every Hilbert space is uniformly convex. 
(a) Prove that Theorem 12.3 holds in X. 
(b) Assume ||x,|| = 1, Ae X*, ||Al| = 1, and Ax,—1. Prove that {x,} is a 
Cauchy sequence (in the norm-topology of X). Hint: Consider A(x, + <,,)- 
(c) Prove that every A € X* attains its maximum on the closed unit ball of X. 
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(d) Assume that x, — x weakly and ||x,]|| > ||x||. Prove that ||x, — x|| > 0. Hint: 
Reduce to the case ||x,|| = 1. Consider A(x, + x), for a suitable A. 

(c) Show that the preceding four properties fail in certain Banach spaces (for 
instance, in L’, or in C). These are therefore not uniformly convex. 

Prove the assertion about the case dim H < oo made in the remark that follows 

Theorem 12.35. 

Find an operator T € &(H), with o(T) = {1}, which is neither unitary nor self- 

adjoint. 

If S is self-adjoint and U = exp (iS), show that U is unitary. Deduce from this, 

and from the fact that o(U) lies on the unit circle, that a(S) lies on the real axis. 

Show that @(T*) = A(T) if T € BA) is normal. Hint: Using Theorem 12.35, 

PS Ttu. 

Define T on H = L7(0, 1) by (Tf)(x) = xf(x). Show that T is self-adjoint and 

that A(T) is a dense proper subspace of H. 

Find a nonnormal T € @(H) such that 


|| T || = sup {|(Tx, x)]: x eH, ||x|| < 1}. 


(This shows that Theorem 12.25 has no converse.) 

Show that T and T* can have the same null space without being normal. 

Let D be a circular annulus in ¢ with center at 0. Define H and M e€ &(H) as in 
Theorem 12.38. Prove that M has no square root in @(H) by completing the 
following outline: Assume Q € @(H), Q? = M. Put u(z) = 1, v(z) =z, h= Qu. 
Since QM = MQ, induction shows that Qu" = hv" for all integers n. It follows 
from the Laurent series expansion that Qf = hf for every fe H. This leads to 
h? = v, ie, h*(z) = z for all z € D, an impossibility. 

Find the adjoint M* of M. (Use Laurent series.) 


CHAPTER 


UNBOUNDED 
OPERATORS 


Introduction 


13.1. Definitions Let H be a Hilbert space. By an operator in H we shall 
now mean a linear mapping T whose domain A(T) is a subspace of H and 
whose range A(T) lies in H. 

It is not assumed that T is bounded or continuous. Of course, if T is 
continuous [relative to the norm topology that A(T) inherits from H] then 
T has a continuous extension to the closure of A(T), hence to H, since 
G(T) is complemented in H. In that case, T is the restriction to A(T) of 
some member of #(H). 

The graph &T) of an operator T in H is the subspace of H x H that 
consists of the ordered pairs {x, Tx}, where x ranges over Z(T). Obviously, 
S is an extension of T [that is, 2(T) < AS) and Sx = Tx for x € A(T)] if 
and only if &(T) c G&S). This inclusion will often be written in the simpler 
form 


(1) Ics, 


A closed operator in H is one whose graph is a closed subspace of 
H x H. By the closed graph theorem, T € @(H) if and only if AT)=H 
and T is closed. 

We wish to associate a Hilbert space adjoint T* to T. Its domain 
G(T*) is to consist of all y € H for which the linear functional 


(2) x + (Tx, y) 
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is continuous on AT). If ye D(T*), then the Hahn-Banach theorem 
extends the functional (2) to a continuous linear functional on H, and there- 
fore there exists an element T*y e€ H that satisfies 


(3) (Tx, y)=(x, T*y) [x € &(T)]. 


Obviously, T*y will be uniquely determined by (3) if and only if AT) is 
dense in H, that is, if and only if T is densely defined. The only operators T 
that will be given an adjoint T* are therefore the densely defined ones. 
Routine verifications show then that T* is also an operator in H, that is, 
that (T*) is a subspace of H and that T* is linear. 

Note that if T € @(A), then the definition of T* given here coincides 
with that given in Section 12.9. In particular, 9(T*) = H and T* € &(A). 

Ordinary algebraic operations with unbounded operators must be 
handled with care; the domains have to be watched. Here are the natural 
definitions for the domains of sums and products: 


(4) WAS + T) = HS) vn AT), 
(5) G(ST) = {x € AT): Tx € HS)}. 
The usual associative laws 
(6) (R+S)+T=R+(S+4 7), (RS)T = R(ST) 


then hold. As regards the distributive laws, one of them, namely, 
(R + S)T = RT + ST, holds in its usual form, but the other one may only 
hold in the form 


(7) T(R + S)>TR+TS, 


since 1t can happen that (R + S)x € A(T), even though one of Rx or Sx is 
not in A(T). Scalar multiplication is defined as follows: If a =0, then 
QY(aT) = H and aT = 0. Ifa £0, then D(aT) = AT) and (aT)x = a(Tx) for 
x € A(T). 


13.2 Theorem Suppose S, T, and ST are densely defined operators in H. 
Then 


(1) TS” Gis) 
If, in addition, S € BA), then 
(2) T*S* = (ST)*. 


Note that (1) asserts that (ST)* is an extension of T*S*. The equality 
(2) implies that T*S* and (ST)* actually have the same domains. 


PROOF. Suppose x € AST) and y € A(T*S*). Then 
(3) (Tx, S*y) = (x, T*S*y), 
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because x € A(T) and S*y € A(T*), and 


(4) (ST x, y) = (Tx, S*y), 
because Tx € AS) and y € A(S*). Hence 

(5) (ST x, y) = (x, T*S*y). 
This proves (1). 


Assume now that S € @(A) and y €e D(ST)*). Then S* € Z(H), 
so that Z(S*) = H, and 


(6) (Tx, S*y) = (STx, y) = (x, (ST)*y) 
for every x € AST). Hence S*y € A(T*), and therefore y € Z(T*S*). 
Now (2) follows from (1). /T// 


13.3. Definition An operator T in H is said to be symmetric if 
(1) (Tx, y) = (x, Ty) 


whenever x € AT) and y ec AT). The densely defined symmetric operators 
are thus exactly those that satisfy 


(2) f le csi fas 


If T = T*, then T is said to be self-adjoint. 

These two properties evidently coincide when T € @(H). In general, 
they do not. 

Moreover, if A(T) is dense and (Tx, y) = (x, Sy) for all x € AT) and 
y € AS), then S c T*. 


13.4 Example Let H = 2? = [7([0, 1]), relative to Lebesgue measure. 
We define operators T,, T,, and T; in L?. Their domains are as follows: 


QY(T,) consists of all absolutely continuous functions f on [0, 1] with 
derivative f’ € L’. 


HT) = AT;) 0 (ff) =f}. 
HT) = HT) 0 {f:f(0) = f(1) = 0}. 
These are dense in I’. Define 
(1) T.f=if'  forfe AT), k =1, 2,3. 
We claim that 
(2) TT, Tee h, TRS T: 
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Since T, < T, < Tj, it follows that T, is a self-adjoint extension of the 
symmetric (but not self-adjoint) operator T, and that the extension T, of T, is 
not symmetric. 

Let us prove (2). Note that 


(3) (T, f, 9) = | (if’\g = | f(ig’) = f, Tn 9) 


when fe A T,), g € A(T,,), and m+ k = 4, since then f(1)g(1) = f(0)g(0). It 
follows that T,, < T¥, or 


(4) one bv T, < T3, T; < Tf. 


Suppose now that g e A(T#) and ¢ = Tg. Put ®(x) = [5 ¢. Then, for 
fe HT), 


(5) | if'9 =(T.f, 9) = (4 &) =f ()@(1) — | f'®. 


When k = 1 or 2, then A(T,) contains nonzero constants, so that (5) implies 
@(1) = 0. When k = 3, then f(1) = 0. It follows, in all cases, that 


(6) ig - De A(T). 


Since &A(T,) = L-, ig =@® if k = 1, and since ®(1) =0 in that case, 
g € &T;). Thus T* c Tj. 
If k = 2 or 3, then &(T,) consists of all u € L’ such that [g u = 0. Thus 


(7) RAT,) = ATs) = Y°, 


where Y is the one-dimensional subspace of I? that contains the constants. 
Hence (6) implies that ig — ® is constant. Thus g is absolutely continuous 
and g’ € L’, that is, g € D(T,). Thus T* c Th. 

If k=2, then ®(1)=0, hence g(0)= (1), and ge AT,). Thus 
TZ ek: 

This completes the proof. 

Before we turn to a more detailed study of the relations between sym- 
metric operators and self-adjoint ones, we insert another example. 


13.55 Example Let H=LT, as in Example 13.4, define Df=f' for 
f¢€ HT,), say (the exact domain is now not very important), and define 
(Mf \t) = tf(t). Then (DM — MD) f =f, or 


(1) DM — MD =I, 


where J denotes the identity operator on the domain of D. 
The identity operator appears thus as a commutator of two operators, 
of which only one is bounded. The question whether the identity is the 
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commutator of two bounded operators on H arose in quantum mechanics. 
The answer is negative, not just in @(H), but in every Banach algebra. 


13.6 Theorem [fA is a Banach algebra with unit element e, if x € A and 
y € A, then 


Xy — yx Fe. 
The following proof, due to Wielandt, does not even use the complete- 
ness of A. 
PROOF. Assume xy — yx = e. Make the induction hypothesis 
(1) x"y — yx" = nx"! £0, 


which is assumed to hold for n= 1. If (1) holds for some positive 
integer n, then x" # 0 and 


n+1 n+1 


x" hy — yx" * = x"™xy — yx) + (x"y — yx")x 

= x"e + nx""!x =(n + 1)x", 

so that (1) holds with n + 1 in place of n. It follows that 
n||x"~* || = ||x"y — yx" || < Ix" yl] < 21x" * Wily, 


or n<2\||x||||y\l, for every positive integer n. This is obviously im- 
possible. //// 


Graphs and Symmetric Operators 


13.7 Graphs_ If H is a Hilbert space, then H x H can be made into a 
Hilbert space by defining the inner product of two elements {a, b} and 
{c,d} of H x H to be 


(1) ({a, b}, {C, d}) = (a, c) + (6, d), 


where (a, c) denotes the inner product in H. We leave it as an exercise to 
verify that this satisfies all the properties listed in Section 12.1. In particular, 
the norm in H x H is given by 


(2) {a,b}? = llall* + |bII?. 
Define 
(3) Vi{a,b$}={-b,a$ (aeH,be H). 
Then V is a unitary operator on H x H, which satisfies V? = —J. Thus 


V?M = M if M is any subspace of H x H. 
This operator yields a remarkable description of T* in terms of T: 
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13.8 Theorem [fT is a densely defined operator in H, then 
(1) GT*) =[VA&T)], 
the orthogonal complement of VG@(T) in H x H. 


Note that once &(T*) is known, so are A(T*) and T*. 


PROOF. Each of the following four statements is clearly equivalent to 
the one that follows and/or precedes it. 


(2) {y, 2} € G(T*). 

(3) (Tx, y) = (x, Zz) for every x € A(T). 

(4) ({ Tx, x}, {y, z}) =0 for every x € A(T). 

(5) {y, z} € [VA(T)}-. [HII 


13.9 Theorem If T is a densely defined operator in H, then T* is a 
closed operator. In particular, self-adjoint operators are closed. 


PROOF. M°* is closed, for every M < H x H. Hence &(T*) is closed in 
H x H, by Theorem 13.8. //// 


13.10 Theorem [fT is a densely defined closed operator in H, then 
(1) H x H=V{T)@ A&T*), 


a direct sum of two orthogonal subspaces. 


PROOF. If YT) is closed, so is V&(T), since V is unitary, and therefore 
Theorem 13.8 implies that VG(T) = [Y(T*)]~; see Theorem 12.4. //// 


Corollary. If a € H and b € H, the system of equations 
—-Tx+y=a 
x+ T*y=b 


has a unique solution with x € B(T) and y € O(T*). 


Our next theorem states some conditions under which a symmetric 
operator is self-joint. 
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13.11 Theorem Suppose T is a densely defined operator in H, and T is 
symmetric. 


(a) 
(b) 


(c) 
(d) 


If 2(T) = H, then T is self-adjoint and T € &(A). 

If T is self-adjoint and one-to-one, then &(T) is dense in H, and T~* is 
self-adjoint. 

If R(T) is dense in H, then T is one-to-one. 

If A(T) = H, then T is self-adjoint, and T~* € BH). 


PROOF. (a) By assumption, T c T*. If D(T) = H, it is thus obvious 
that T = T*. Hence T is closed (Theorem 13.9) and therefore contin- 
uous, by the closed graph theorem. (We could also refer to Theorem 
5.1.) 


(b) Suppose y | &(T). Then x >(Tx, y) =0 is continuous in 
G(T), hence y € D(T*) = A(T), and (x, Ty) =(Tx, y)=0 for all 
x € GT). Thus Ty =0. Since T is assumed to be one-to-one, it 
follows that y = 0. This proves that @(T) is dense in H. 

T~* is therefore densely defined, with 2(T~*')= &(T), and 
(T~ *)* exists. The relations 


(1) GT~') = VG@(—T) and VHT *)=G(—-T) 
are easily verified: 
{a, b} © WT *')< {b, a} © G(T) <= {b, —a} © &—T) 
<> {a,b} e VH&(—T). 


Being self-adjoint, T is closed (Theorem 13.9); hence —T is 
closed, and hence T~* is closed, by (1). Theorem 13.10 can now be 
applied to T~* and to — T and yields the orthogonal decompositions 


(2) Hx H=V4T"')®@G(T*)*) 
and 
(3) Hx H=V&-T)@®H-T)=HT )@VHT~'). 
Consequently, 
(4) G(T ~')*) =[(VAT')}° = W(T~*), 
which shows that (T~')* = T™'. 
(c) Suppose Tx =0. Then (x, Ty) =(Tx, y)=0 for every 
y € &T). Thus x | A&(T), and therefore x = 0. 


(d) Since A(T) =H, (c) implies that T is one-to-one, and 
Q(T ')=H.If x e H and ye H, then x = Tz and y = Tw, for some 
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z € Y(T) and we HT), so that 
(T~ ‘x, y) =(z, Tw) = (Tz, w) = (x, T~ *y). 


Hence T~' is symmetric, (a) implies that T~' is self-adjoint (and 
bounded), and now it follows from (b) that T =(T~')~* is also self- 
adjoint. //// 


13.12 Theorem If T is a densely defined closed operator in H, then 
G(T*) is dense and T** = T. 


PROOF. Since V is unitary, and V* = —I, Theorem 13.10 gives the 
orthogonal decomposition 


(1) Hx H=T)@® V&A T*). 
Suppose z | Y(T*). Then (z, y) = 0 and therefore 
(2) ({0, z}, {—T*y, y}) =0 


for all y € D(T*). Thus {0, z} e [VG(T*)]* = HT), which implies that 
z = T(0) = 0. Consequently, A(T*) is dense in H, and T** is defined. 
Another application of Theorem 13.10 gives therefore 


(3) Hx H=VG(T*)@ &(T**). 

By (1) and (3), 

(4) GT**) =[VA&T*)]" = &(T), 

so that T** = T. //// 


We shall now see that operators of the form T*T have interesting 


properties. In particular, @(T*T) cannot be very small. 


13.13 Theorem Suppose T is a densely defined closed operator in H, and 
Q=14T*T. 


(a) 


(b) 


Under these assumptions, Q is a one-to-one mapping of 
G(Q) = AT*T) = {x € HT): Tx € H(T*)} 


onto H, and there are operators Be &(HA), C € AA) that satisfy 
Bl < 1, |Cl| < 1, C = TB, and 


(1) BU + T*T) C(I + T*T)B =I. 


Also, B > 0, and T*T is self-adjoint. 
If T’ is the restriction of T to O(T*T), then G(T’) is dense in G(T). 
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Here, and in the sequel, the letter J denotes the identity operator with 
domain H. 


PROOF. If x € BQ) then Tx € D(T*), so that 
(2) (x, x) + (Tx, Tx) = (x, x) + (x, T*Tx) = (x, Qx). 


Therefore ||x||? < ||x||||Qx||, which shows that Q is one-to-one. 
By Theorem 13.10 there corresponds to every he H a unique 
vector Bh € OT) and a unique Ch € Y(T*) such that 


(3) {0, h} = {—TBh, Bh} + {Ch, T*Ch}. 


It is clear that B and C are linear operators in H, with domain H. The 
two vectors on the right of (3) are orthogonal to each other (Theorem 
13.10). The definition of the norm in H x H implies therefore that 


(4) |All? > Bhi? + Chil? (he A), 


so that ||B|| < 1 and ||C|| < 1. 
Consideration of the components in (3) shows that C = TB and 
that 


(5) h = Bh + T*Ch = Bh + T*TBh = QBh 


for every he H. Hence QB =I. In particular, B is a one-to-one 
mapping of H onto AQ). If y e BQ), then y = Bh for some he H, 
hence Qy = QBh=h, and BQy = Bh= y. Thus BQ cI, and (1) is 
proved. 

If h e H, then h € Qx for some x € GQ), so that 


(6) (Bh, h) = (BQx, Qx) = (x, Qx) > 0, 


by (2). Thus B > 0, B is self-adjoint (Theorem 12.32), and now (b) of 
Theorem 13.11 shows that Q 1s self-adjoint, hence so is T*T = Q — I. 
This completes the proof of part (a). 
Since T is a closed operator, G(T) is a closed subspace of 
H x H; hence &(T) is a Hilbert space. Assume {z, Tz} € &(T) is 
orthogonal to G(T"). Then, for every x € D(T*T) = FQ), 


O = ({z, Tz}, {x, Tx}) = (z, x) + (Tz, Tx) = (z, x) + (z, T*Tx) 
= (z, Qx). 
But #2(Q) = H. Hence z = 0. This proves (b). //// 


13.14 Definition A symmetric operator T in H is said to be maximally 
symmetric if T has no proper symmetric extension, 1.e., if the assumptions 


(1) Tes. S symmetric 
imply that S = T. 
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13.15 Theorem  Self-adjoint operators are maximally symmetric. 


PROOF. Suppose T is self-adjoint, S is symmetric (that is, S c S*), and 
T <S. This inclusion implies obviously (by the very definition of the 
adjoint) that S* c T*. Hence 


Seo GT * Tes. 
which proves that S = T. //// 


It should be noted that maximally symmetric operators need not be 


self-adjoint; see Example 13.21 and Exercise 10. 


13.16 Theorem If T is a symmetric operator in H (not necessarily 
densely defined), the following statements are true: 


| Tx + ix||* = lx]? + Tx? = [x € AT). 

T is a closed operator if and only if A(T + il) is closed. 

T + il is one-to-one. 

If A(T + il) = H, then T is maximally symmetric. 

The preceding statements are also true if i is replaced by —i. 


PROOF. Statement (a) follows from the identity 
| Tx + ix||? = |]x||? + || Tx]? + (ix, Tx) + (Tx, ix), 
combined with the symmetry of T. By (a), 
(T + il)x  {x, Tx} 


is an isometric one-to-one correspondence between the range of 
T + il and the graph of T. This proves (b). Next, (c) is also an imme- 
diate consequence of (a). If A(T + il) = H and T;, is a proper exten- 
sion of T [that is, A(T) is a proper subset of Z(T,)], then T, + if isa 
proper extension of T + iJ which cannot be one-to-one. By (c), T, is 
not symmetric. This proves (d). 

It is clear that this proof is equally valid with —i in place of i. 


//// 


The Cayley Transform 
13.17 Definition The mapping 


(1) 


t—1 
=z ° 
ba 


t 


sets up a one-to-one correspondence between the real line and the unit 
circle (minus the point 1). The symbolic calculus studied in Chapter 12 
shows therefore that every self-adjoint T ¢ @(H) gives rise to a unitary 
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operator 
(2) U =(T —il(T + il)! 


and that every unitary U whose spectrum does not contain the point 1 is 
obtained in this way. 

This relation T «> U will now be extended to a one-to-one correspon- 
dence between symmetric operators, on the one hand, and isometries, on 
the other. 

Let T be a symmetric operator in H. Theorem 13.16 shows that 


(3) | Tx + ix||? = |x|]? + Tx]? = || Tx — ix]? (x € A(T). 
Hence there is an isometry U, with 

(4) QU) = AT + id), RU) = A(T — il), 

defined by 

(5) U(Tx + ix) = Tx — ix (x € A(T)). 


Since (T + il)~' maps ZU) onto AT), U can also be written in the 
form 
(6) U =(T —il(T + il. 


This operator U is called the Cayley transform of T. Its main features 
are summarized in Theorem 13.19. It will lead to an easy proof of the spec- 
tral theorem for self-adjoint (not necessarily bounded) operators. 


13.18 Lemma _ Suppose U is an operator in H which is an isometry: 
|| Ux|| = ||x|| for every x € BU). 

(a) Ifx € BU) and ye HV), then (Ux, Uy) = (x, y). 

(b) If AU — U) is dense in H, then I — U is one-to-one. 

(c) If any one of the three spaces BU), AU), and GU) is closed, so are the 


other two. 


PROOF. Any of the identities listed in Exercise 2 of Chapter 12 proves 
(a). To prove (b), suppose x € ZU) and (J — U)x = 0, that is, x = Ux. 
Then 


(x, (I = U)y) a7 (x, y) _ (x, Uy) = (Ux, Uy) _ (x, Uy) = 0 


for every ye DU). Thus x 1 AU — UV), so that x = 0 if AU — UV) is 
dense in H. The proof of (c) is a consequence of the relations 


1 
| Ux — Uy|| = |x — yll = We l{x, Ux} — ty, Uy}|), 


which hold for all x, y € B(U). [/// 
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13.19 Theorem Suppose U is the Cayley transform of a symmetric oper- 
ator T in H. Then the following statements are true: 


(a) U is closed if and only if T is closed. 
(b) AI — U) = HT), I — U is one-to-one, and T can be reconstructed from 
U by the formula 
T =i + U\(I—U)"*. 
(The Cayley transforms of distinct symmetric operators are therefore 
distinct.) 
(c) U is unitary if and only if T is self-adjoint. 


Conversely, if V is an operator in H which is an isometry, and if I — V 
is one-to-one, then V is the Cayley transform of a symmetric operator in H. 


PROOF. By Theorem 13.16, T is closed if and only if @(T + il) is 
closed. By Lemma 13.18, U is closed if and only if ZU) is closed. 
Since Z(U) = A(T + il), by the definition of the Cayley transform, (a) 
is proved. 

The one-to-one correspondence x«+z between Q&T) and 
QU) = AT + il), given by 


(1) z= TX + ix, Uz = Tx — ix 
can be rewritten in the form 
(2) (I — U)z = 2ix, (I + U)z = 2Tx. 


This shows that J — U is one-to-one, that AJ — U) = A(T), so that 
(I — U)~* maps A(T) onto AU), and that 


(3) 2Tx=(1+ U)z =(I + U)T — U)"'(2ix) [x € 9(T)]. 


This proves (5). 
Assume now that T is self-adjoint. Then 
(4) Al +T?)=H 
by Theorem 13.13. Since 
(5) (T + iT —i) =1+ T*? =(T —ill(T + il) 
[the three operators (5) have domain Z(T7)], it follows from (4) that 
(6) QU) = AT +il)=H 
and 


(7) RU) = AT — il) = H. 


CHAPTER 13: UNBOUNDED OPERATORS 359 


Since U is an isometry, (6) and (7) imply that U is unitary (Theorem 
12.13). 
To complete the proof of (c), assume that U is unitary. Then 


(8) [AI — U)}" = MI — U) = {0}, 


by (b) and the normality of J — U (Theorem 12.12), so that DT) = 
AI — UV) dense in H. Thus T® is defined, and T c T*. 

Fix ye GT*). Since A(T + il) = BU) = H, there exists yo € 
Q(T) such that 


(9) (T* + il)y =(T + iyo =(T* + Wyo. 


The last equality holds because T c T*. If y,; = y— yo, then y, € 
Y(T*) and, for every x € ZT), 


(10) ((T — il)x, y1) = (x, (T* + iy.) = (x, 0) = 0. 


Thus y, | A(T — il) = AU) = H, and so y, = 0, and y= yo € AT). 
Hence T* c T, and (c) is proved. 
Finally, let V be as in the statement of the converse. Then there 
is a one-to-one correspondence z++x between AV) and AI — V), 
given by 


(11) x=z— Vz. 
Define S on ZS) = AI — V) by 
(12) Sx = i(z + Vz) ixeeze Vz 


If x €e AS) and ye AS), then x = z — Vz and y= u — Vu for some 
ze€ DV) and ue AV). Since V is an isometry, it now follows from (a) 
of Lemma 13.18 that 


(13) (Sx, y) = i(z + Vz, u— Vu) = (Vz, u) — i(z, Vu) 
= (z — Vz, iu + iVu) = (x, Sy). 

Hence S is symmetric. Since (12) can be written in the form 

(14) 2iVz = Sx — ix, 2iz = Sx + ix [ze AV)], 

we see that 

(15) V(Sx + ix) = Sx — ix [x € HS)] 


and that Z(V) = A(S + il). Therefore V is the Cayley transform of S. 
//I/ 
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13.20 The deficiency indices If U, and U, are Cayley transforms of 
symmetric operators T, and T;, it is clear that T, < T, if and only if U, < 
U,. Problems about symmetric extensions of symmetric operators reduce 
therefore to (usually easier) problems about extensions of isometries. 

Let us now consider a closed and densely defined sjmmetric operator 
T in H, with Cayley transform U. Then &(T + il) and &(T — il) are closed 
(see Theorem 13.16), and U is an isometry carrying the first onto the 
second. The dimensions of the orthogonal complements of these two spaces 
are called the deficiency indices of T. (The dimension of a Hilbert space is, by 
definition, the cardinality of any one of its orthonormal bases.) 

Since AT — U) = A(T) is now assumed to be dense in H, every iso- 
metric extension U, of U has AI — U,) dense in H, so that I — U, is 
one-to-one (Lemma 13.18) and U, is the Cayley transform of a symmetric 
extension 7, of T. 

The following three statements are easy consequences of Theorem 
13.19 and the preceding discussion; we still assume that T is closed, sym- 
metric, and densely defined. 


(a) T is self-adjoint if and only if both its deficiency indices are 0. 
(b) T is maximally symmetric if and only if at least one of its deficiency 
indices is 0. 


(c) T has a self-adjoint extension if and only if its two deficiency indices are 
equal. 


The proofs of (a) and (b) are obvious. To see (c), use (c) of Theorem 
13.19 and note that every unitary extension of U must be an isometry of 
[A(T + il)]~ onto [A(T — il)]-. 


13.21 Example Let V be the right shift on 7*. Then V is an isometry 
and I — V is one-to-one (Chapter 12, Exercise 18), and so V is the Cayley 
transform of a symmetric operator T. Since A(V) = ¢” and A(V) has co- 
dimension 1, the deficiency indices of T are 0 and 1. 

This provides us with an example of a densely defined, maximally 
symmetric, closed operator T which is not self-adjoint. 


Resolution of the Identity 


13.22 Notation MM will now be a o-algebra in a set Q, H will be a 
Hilbert space, and E: It — B&(HA) will be a resolution of the identity, with all 
the properties listed in Definition 12.17. Theorem 12.21 describes a sym- 
bolic calculus which associates to every f ¢ L*(E) an operator P(f) € BA), 
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by the formula 


(1) (H(f)x, y) = | fae. (x € H, y € A). 


This will now be extended to unbounded measurable functions f (Theorem 
13.24). We shall use the same notations as in Definition 12.17. 


13.23, Lemma _ Let f: Q— ¢ be measurable. Put 


(1) a,={xeH:| if? dB. <oo}. 


Then @, is a dense subspace of H. If x € H and y € H, then 


1/2 
(2) [isidie.sis ity [irr iE... 


If f is bounded and v = ‘P(f )z, then 


(3) dE, ,=f dE,, (xe H,ze A). 


PROOF. If z= x+y, andweé Mt, then 

|E(o)z||* < (| E(@)x|| + ||E(@)yll)? < 2, E@)x||? + 2, E@)yll? 
or 
(4) E,, (@) < 2E,, (@) + 2E, ,(). 


It follows that Z, is closed under addition. Scalar multiplication 1s 
even easier. Thus 2, is a subspace of H. 

For n = 1, 2, 3,..., let w, be the subset of Q in which | f| < n. If 
x € A(E(o,)) then 


(5) E(@)x = E(w)E(@,)x = E(@ A @,)x 
so that 
(6) E,, (@)=E, (9 @,) (ae Dd, 


and therefore 


(7) a iE,.=| |f |? dE, . <n? ||x||* < . 
Q On 


Thus A(E(w,)) < D,. Since Q = (JP, @,, the countable additivity of 
w — E(w)y implies that y = lim E(,)y for every y € H, so that y lies 
in the closure of ,. Hence Y, is dense. 
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If x e H, y € H, and f is a bounded measurable function on Q, 
the Radon-Nikodym theorem [23] shows that there is a measurable 
function u on Q, with |u| = 1, such that 


(8) uf dE, ,=|f|d|E, y|. 
Hence 
(9) [us d| E, | = (P(uf)x, y) < |P(uf)x || Iyll- 


By Theorem 12.21, 


(10) I'Puf)x||? = {we dE. x -| apes ee 
Q Q 
Now (9) and (10) give (2) for bounded f. The general case follows from 


this. 
Finally (3) holds because 


| dE, , = (P(g)x, v) = (Pg)x, P(F)z) 
= (P(f)P(g)x, 2) = (Y(fg)x, 2) = | @ Ce 


for every bounded measurable g, by Theorem 12.21. /11/ 


13.24 Theorem Let E be a resolution of the identity, on a set Q. 


(a) 


(b) 


(c) 


To every measurable f:Q—+€ corresponds a densely defined closed 
operator ‘P(f) in H, with domain B(Y(f)) = B,, which is characterized 


by 

(1) (P(f)x, n= | fae, (xE€ Dy, ye H) 
and which satisfies 

(2) I(f)xI1* = [ure dE, (x € Dy). 


The multiplication theorem holds in the following form: If f and g are 
measurable, then 


(3) ‘PUP)¥(g)— P(g) = and ——_ G(P(F)¥P(9)) = By 0 Duy. 
Hence ‘¥(f)¥(g) = (fg) if and only if D,, € Da. 
For every measurable f : Q- ¢, 


(4) P(f)* = HS) 
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and 


(5) P(A S)* = PF?) = PUP)*Y(S). 


PROOF. If xe Y, then y> [of dE, is a bounded conjugate-linear 
functional on H, whose norm is at most ({ | f |? dE,, ,)'/*, by (2) of 
Lemma 13.23. It follows that there is a unique element ‘(f)x ¢ H 
that satisfies (1) for every y € H and that 


(6) IM(f)xll? < [ure dE...  (x€QDy). 


The linearity of ¥(f) on D, follows from (1), since E,, , is linear in x. 
Associate with each f its truncations f, = fo,, where @,(p) = 1 if 


| f(p)| <1, $,(p) = 0 if | f(p)| > 0. 


Then Y,_,, = D,, since each f, is bounded, and therefore (6) 
shows, by the dominated convergence theorem, that 


(7) IM — Hel? < | LAP dE,,70 asn-+o, 


for every x € Y,. Since f, is bounded, (2) holds with f, in place of f 
(Theorem 12.21). Hence (7) implies that (2) holds as stated. 

This proves (a), except for the assertion that ‘¥(/) is closed. The 
latter follows from Theorem 13.9 if (4) (to be proved presently) is 
applied to fin place of f. 

We turn to the proof of (5). 

Assume first that f is bounded. Then 9,,c Y,. If ze H and 
v = P(f)z, Equation (3) of Lemma 13.23 and Theorem 12.21 show 
that 


(Y(f)P(g)x, z) = (P(g)x, ¥(F)z) = (¥g)x, v) 
= { g dE, = | oF. , = (V(fg)x, 2). 


Hence 


(8) P(f)¥(g)x = FUg)x (xe B,, fe L*). 
If y = V(g)x, it follows from (8) and (2) that 


(9) [ise dé, = | Jar dE,, (x€QD,,feL*). 


Now let f be arbitrary (possibly unbounded). Since (9) holds for 
all f € L®, it holds for all measurable f. Since DZ(Y(f)¥(g)) consists of 
all x € , such that ye D,, and since (9) shows that y € J, if and 
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only if x € D,,, we see that 
(10) WLS YLG) = Dy 0 Dy. 


If xe BD, D,,, if y= VP(g)z, and if the truncations f, are 
defined as above, then f, >f in L(E, ,), f,g >fg in L-(E,, ,), and now 
(8) (with f, in place of f) and (2) imply 


PPP (g)x = P(f)y = lim PCf,)y = lim PCF, g)x = PCfg)x. 


no no 


This proves (3) and hence (b). 
Suppose now that x e 9, and ye 97 = @,. It follows from (7) 
and Theorem 12.21 that 


(Y(f)x, y) = lim (‘P(f,)x, y) = lim (x, PF) = Ox, YP)y). 


no n> 


Thus y € D(Y(f)*), and 
(11) Y(f) < Pf). 

To pass from (11) to (4) we have to show that every 
z€ YUWY(f)*) lies in D,. Fix z; put v = P(f)*z. Since f, = fo,, the 
multiplication theorem gives 


(12) Pf) = P(F)E(O,)- 


Since ‘¥(@,) is self-adjoint, we conclude from Theorems 13.2 and 12.21 
that 


¥(b,)P(S)* < LY(S)P(b,)]* = PU)* = PCS). 
Hence 
(13) Y(d,vu=V(f)z (n=1, 2, 3,...). 
Since | @,,| < 1, (13) and (2) imply 


(14) a dE, = [144 dE, S E,, (Q) 


for n = 1, 2, 3,.... Hence z € Y,, and (4) is proved. 
Finally, (5) follows from (4) by another application of the multi- 
plication theorem, because 9,7; < D,. //// 


Remark. If g is bounded, then 9,, c Y, (simply because Y, = H) so 
that ‘P(f)¥(g) = Y(fg). This was used in (12). It also shows, for 
bounded g, that 


(15) P(g)¥(f) < PUP) P(9), 
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because ‘Y(g)¥(f) < Vigf) = V(/g). If g is the characteristic function 
of a measurable set w < Q, (15) becomes 


(16) E(w)'P(f) < ‘P(f)E(o). 
Ifx eB, A A(E(o)), it follows that 
(17) E(o)'P(f)x = ‘P(f)E(@)x = P(f)x. 


Thus ‘¥(f) maps 9, 0 A(E(@)) into A(E(@)). 

This should be compared with the discussion of invariant sub- 
spaces in Section 12.27. 

Note also that, by analogy with (3), 


(18) Pf) + P(g) < PF + 9). 


Equality holds if and only if D,,, =D, A Y,, which is true when- 
ever at least one of f, g is bounded. 


13.25 Theorem In the situation of Theorem 13.24, 9, = H if and only if 
fe L*(E). 


PROOF. Assume Y, = H. Since ‘¥(f) is a closed operator, the closed 
graph theorem implies that ‘¥(f) ¢ @(A). If f, = fd, is a truncation of 
f, it follows from the multiplication theorem, combined with Theorem 
12.21, that 


Sala = IPD = IPO PO@DI SPOIL, 


since ||'P(?,)|l = |nllo <1. Thus || fll, < ||P), and fe L°(E). The 
converse is contained in Theorem 12.21. ITH 


13.26 Definition The resolvent set of a linear operator T in H is the set 
of all Ae € such that T — AI is a one-to-one mapping of YT) onto H 
whose inverse belongs to @(A). 

In other words, T — AI should have an inverse S € &(H), which 
satisfies 


S(T — AD <(T — ANS = 1. 


For instance, Theorem 13.13 states that — 1 lies in the resolvent set of 
T*T if T is densely defined and closed. 

The spectrum o(T) of T is the complement of the resolvent set of T, 
just as for bounded operators. 

Some properties of o(T), for unbounded T, are described in Exercises 
17 to 20. 
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For the next theorem, we refer to Section 12.20 for the definition of 
the essential range of a function, with respect to a given resolution of the 
identity. 


13.27. Theorem Suppose E is a resolution of the identity on a set Q, 
f:Q— Cis measurable, and 


O,={peQf(p)=a} (ae). 


(a) If « is in the essential range of f and E(w,) # 0, then ‘P(f) — al is not 
one-to-one. 

(b) If « is in the essential range of f but E(w,) = 0, then ‘¥(f) — al is a 
one-to-one mapping of 2, onto a dense proper subspace of H, and there 
exist vectors x, € H, with ||x,|| = 1, such that 

lim [Y(/)x, — «x, ] = 0. 


n> @ 


(c) o(¥(S)) is the essential range of f. 


In the terminology used earlier for bounded operators, we may say 
that a lies in the point spectrum of ‘¥(f) in case (a) and in the continuous 
spectrum of ‘P(f) in case (b). The conclusion of (b) is sometimes stated by 
saying that « is an approximate eigenvalue of ‘¥(f). 


PROOF. We shall assume, without loss of generality, that « = 0. 

(a) If E(w) # 0, there exists x9 € A(E(w,)) with ||xo|| = 1. Let 
go, be the characteristic function of @,. Then f@,) =0, hence 
Y(f)¥(¢_) = 0, by the multiplication theorem. Since ¥(@_) = E(apo), it 
follows that 


PP )xXo = PP)E(@o)xo = ‘P(F)P(o)X0 = 9. 


(b) The hypothesis is now that E(@,))=0 but E(@,) #0 for 
n= 1, 2,3,..., where 


oy= fre a: isl <=} 


Choose x, € A(E(a,)), ||x,|| = 1; let d, be the characteristic functions 
of w,. The argument used in (a) leads to 


=a |e 


IPP) xnll = IPCFOa)xall S IPOSOa | = WFPallo S 


Thus ¥(f)x, — 0 although ||x, || = 1. 
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If ¥(f)x = 0 for some x € %,, then 


| lf? dE, x = |S) x||? = 0. 


Since | f|>0 ae. LE, .J], we must have E, (Q)=0. But E, (Q) = 
\|x||?. Hence Y(f) is one-to-one. 

Likewise Y(f)* = ‘P(f) is one-to-one. If y 1 A(Y(f)), then 
x — (P(f)x, y) = 0 is continuous in Y,, hence y e D('¥(f)*), and 


(x, P(f)y) =(¥(P)x, =O (XE Dy), 


Therefore, ‘Y(f)y = 0, and y = 0. This proves that A(Y(f)) is dense 
in H. 

Since Y(f) is closed, so is ¥(f)~*. If A(W(/)) filled H, the closed 
graph theorem would imply that ‘¥(f)~* € @(A). But this is impossi- 
ble, in view of the sequence {x,} constructed above. 

Hence (b) is proved. 


(c) It follows from (a) and (b) that the essential range of f is a 
subset of o(‘¥(f)). To obtain the opposite inclusion, assume 0 is not 
in the essential range of f. Then g=1/fe L°(E), fg =1, hence 
Y(f)¥(g) = P(1) = I, which proves that A(‘P(f)) = H. Since | f| > 0, 
‘Y(f) is one-to-one, as in the proof of (b). Therefore ‘¥(f)~ * € @(H), by 
the closed graph theorem. 

This completes the proof. /T// 


The following theorem is sometimes called the change of measure 


principle. 


13.28 Theorem Suppose 


(a) Wt and Wt are o-algebras in sets Q and , 
(b) E: St— BA) is a resolution of the identity, and 
(c) @:Q—>Q has the property that d~ *(w') € M for every w' € M’. 


If E(o’) = E(b~ *(@’)), then E': WM — BHA) is also a resolution of the 


identity, and 


| sae, = [ur b) dE, 


for every St'-measurable f: Q' > € for which either of these integrals exists. 


PROOF. For characteristic functions f, (1) is just the definition of E’. 
Hence (1) holds for simple functions f’ The general case follows from 
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this. The proof that E’ is a resolution of the identity is a matter of 
straightforward verifications and is omitted. //// 


The Spectral Theorem 


13.29 Normal operators A (not necessarily bounded) linear operator 
T in H is said to be normal if T is closed and densely defined and if 


ded ato i Goal 


Every ‘(f) that arises in Theorem 13.24 is normal; this is part of the 
statement of the theorem. We shall now see, just as in the bounded case 
discussed in Chapter 12, that all normal operators can be represented in 
this way, by means of resolutions of the identity on their spectra (Definition 
13.26). For self-adjoint operators, this can be deduced very quickly from the 
unitary case, via the Cayley transform (Theorem 13.30). For normal oper- 
ators in general, a different proof will be given in Theorem 13.33. 


13.30 Theorem To every self-adjoint operator A in H corresponds a 
unique resolution E of the identity, on the Borel subsets of the real line, such 
that 


(1) (Ax, y) = " t dE, ,(t) (x € D(A), y € A). 


Moreover, E is concentrated on o(A) <(—, 0), in the sense that 
E(o(A)) = I. 


As before, this E will be called the spectral decomposition of A. 


PROOF. Let U be the Cayley transform of A, let Q be the unit circle 
with the point 1 removed, and let E’ be the spectral decomposition of 
U (see Theorems 12.23 and 12.26). Since J —U is one-to-one 
(Theorem 13.19), E’({1}) = 0, by (b) of Theorem 12.29, and therefore 


(2) (Ux, 9) = | AdE (4) (xe Hye H). 
Define 
3) fiy=4 2 veg 


and define ‘P(f) as in Theorem 13.24 with E’ in place of E: 


(4) (PCF) x, n= | fae, (xe D,, y € H). 
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Since f is real-valued, ‘P(f) is self-adjoint (Theorem 13.24), and 
since f(A)(1 — A) = i(1 + A), the multiplication theorem gives 


(5) Y(f)7 — U) = iI + UV). 
In particular, (5) implies that AU — U) < Z(¥(f)). By Theorem 13.19, 
(6) AU — U) = iI + UV), 


and GY(A) = ATI — U) < W(Y(f)). Comparison of (5) and (6) shows 
now that ‘(/) is a self-adjoint extension of the self-adjoint operator 
A. By Theorem 13.15, A = ‘¥(f). Thus 


(7) (Ax, y) = | rae.., [x € AA), ye A). 


By (c) of Theorem 13.27, o(A) is the essential range of f: Thus 
o(A) < (— @, o). Note that fis one-to-one in Q. If we define 


(8) E(f(@)) = E(@) 


for every Borel set @ < Q, we obtain the desired resolution E which 
converts (7) to (1). 

Just as (1) was derived from (2) by means of the Cayley trans- 
form, (2) can be derived from (1) by using the inverse of the Cayley 
transform. The uniqueness of the representation (2) (Theorem 12.23) 
leads therefore to the uniqueness of the resolution E that satisfies (1). 

This completes the proof. //// 


The machinery developed in Theorem 13.24 can now be applied to 
self-adjoint operators. The following theorem furnishes an example of this. 


13.31 Theorem Let A be a self-adjoint operator in H. 


(a) (Ax, x) >0 for every x € BA) (briefly: A > 0) if and only if o(A)c 
LO, oo). 
(b) If A > 0, there exists a unique self-adjoint B > 0 such that B? = A. 


PROOF. The proof of (a) is so similar to that of Theorem 12.32 that we 
omit it. 
Assume A > 0, so that a(A) c [0, 00), and 


oO 


(1) (Ax, y) = | tdE, (t) [xe AA), ye A], 


where QA) = {x € H: | t? dE,, ,(t) < oo}; the domain of integration 
is [0, oo). Let s(t) be the nonnegative square root of t > 0, and put 
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B = ‘(s); explicitly, 

(2) (Bx, y) = | s(t) dE, ,(t) (xe, ye A). 
0 


The multiplication theorem (b) of Theorem 13.24, with f=g=s, 
shows that B? = A. Since s is real, B is self-adjoint [(c) of Theorem 
13.24], and since s(t) > 0, (2) with x = y, shows that B > 0. 

To prove uniqueness, suppose C is self-adjoint, C > 0, C* = A, 
and E‘ is its spectral decomposition: 


(3) (eae | “s dE (s) (xe WO, ye H) 
Apply Theorem 13.28 with Q = [0, 00), d(s) = s?, f(t) = t, and 
(4) E'(¢(@)) = E“(@) for w < [0, oo), 

to obtain 


(5) (Ax, y) =(C?x, y) = ['s dE. ,(s) = x dE’. ,(t). 


0 


By (1) and (5), the uniqueness statement in Theorem 13.30 shows that 
E' = E. By (4), E determines E°, and hence C. //// 


The following properties of normal operators will be used in the proof 


of the spectral theorem 13.33. 


13.32 Theorem If N is anormal operator in H, then 


(a) 
(5) 
(c) 


BN) = AN*), 
|| Nx|| = ||N*x|| for every x € DN), and 
N is maximally normal. 


PROOF. If yE DN*N) = D(NN*%), then (Ny, Ny) = (y, N*Ny) because 
Ny € A(N*), and (N*y, N*y) =(y, NN*y) because N*y € ZN) and 
N = N** (Theorem 13.12). Since N*N = NN%*, it follows that 

(1) INyll =||N*yl if y e DIN*N). 


Now pick x € BN). Let N’ be the restriction of N to D(N*N). 
By Theorem 13.13, {x, Nx} lies in the closure of the graph of N’. 
Hence there are vectors y, € Z(N*N) such that 


(2) ly; — x] +O asia 
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and 
(3) || Ny; — Nx|| ~ 0 as i oo. 


By (1), ||N*y; — N*y,|| = ||Ny; — Nyjl], so that (3) implies that {N*y;} 
is a Cauchy sequence in H. Hence there exists z € H such that 


(4) || N*y; — z|| 70 asi o. 


Since N* is a closed operator, (2) and (4) imply that {x, z} € G(N*). 
From this we conclude first that x € D(N*), so that DN) c 

Q(N*), and secondly that 

(5) |N*x|| = |[z|]| = lim ||N*y,|| = lim | Ny,|| = IN|). 


This proves (b) and half of (a). For the other half, note that N* is also 
normal (since N** = N), so that 


(6) QG(N*) < A(N**) = DN). 
Finally, suppose M is normal and N c M. Then M* c N*, so 
that 
(7) Q(M) = D(M*) < A(N*) = BN) c AWM), 
which gives 2(M) = BN); hence M = N. hi) 


13.33. Theorem Every normal operator N in H has a unique spectral 
decomposition E, which satisfies 


(1) (Nx, y) = | 


a(N 


4 dE,, (A) (x € DAN), y € A). 
) 


Moreover, E(@)S = SE(w) for every Borel set to m < a(N) and for every 
S € BA) that commutes with N, in the sense that SN < NS. 


It also follows from (1) and Theorem 13.24 that E(w)N < NE(o). 


PROOF. Our first objective is to find self-adjoint projections P,, with 
pairwise orthogonal ranges, such that P,;N < NP; € @(A), NP; is 
normal, and x =) P;x for every x € H. The spectral theorem for 
bounded normal operators will then be applied to the operators NP,, 
and this will lead to the desired result. 

By Theorem 13.13, there exist B € @(H) and C € &(H) such that 
B > 0, ||B|| < 1, C = NB, and 


(2) BU + N*N) cI =(I + N*N)B. 
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Since N*N = NN%*, (2) implies 
(3) BN = BN(II + N*N)B = BI + N*N)NBCNB=C., 


Consequently, BC = B(NB)=(BN)Bc< CB. Since B and C are 
bounded, it follows that BC = CB and therefore that C commutes 
with every bounded Borel function of B. (See Section 12.24.) 

Choose {t;} so that l=t) >t, >t, >°-::, lim t;=0. Let p; be 
the characteristic function of (t;, t;_,], for i= 1, 2, 3, ..., and put 
f(t) = p{t)/t. Each f, is bounded on o(B) c [0, 1]. Let E® be the spec- 
tral decomposition of B. The equality (2) shows that B is one-to-one, 
that is, 0 is not in the point spectrum of B. Hence E*({0}) = 0, and E? 
is concentrated on (0, 1]. 

Define 


(4) P,=p{B) (i=1,2,3,...). 


Since p,;p; =0 if i#j, the projections P; have mutually orthogonal 
ranges. Since )° p; is the characteristic function of (0, 1], we have 


(5) ; P,x = E20, 1])x =x (xe H). 
Since p(t) = tft), 
(6) NP; = NBf(B) = Cf{B) € BA), 
and P;N = f(B)BN c f,{B)C, by (3), so that 
(7) P,;N < NP,. 

By (6), A(NP;) = H, so that 
(8) AP) < DN) (i = 1, 2, 3,....). 


Hence, if P;x = x, (7) implies P; Nx = NP;x = Nx. Thus N carries 
A(P;) into A(P,), or: A(P;) is an invariant subspace of N. 

Next, we wish to prove that each NP; is normal. By (7) and 
Theorem 13.2, 


(9) (NP;)* < (P;N)* = N*P;,. 

But NP; € BA), so that (NP;)* has domain H. Hence 

(10) (NP,)* = N*P,, 

and now Theorem 13.32 shows, by (8) and (10), that 

(11) INP; x|| = |N*P; xl] = (NP)*x|| (x € A). 


By Theorem 12.12, (11) implies that NP; is normal. 
Hence (5), (6), and (7) show that our first objective has now been 
reached. 
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By Theorem 12.23, each NP; has a spectral decomposition E', 
defined on the Borel subsets of €. 

Since N carries A(P;) into A(P;), P; commutes with NP;. There- 
fore P; commutes with E'(@), for every Borel set w < ¢, so that 


(12) EX@)P,x=P,E(@)xe AP) (xe H,i=1,2,3,..,). 


Since these ranges are pairwise orthogonal, and since (5) implies 


t= 


(13) » WE@)P:xl? < db, Pixll? = xl, 
1 i=1 


the series } E'(w)P; x converges, in the norm of H, and it makes sense 
to define 


(14) Blo) = ¥ E(o)P, 


for all Borel sets w c €. 
It is easy to check that E is a resolution of the identity. Hence 
there is a normal operator M, defined by 


(15) (Mx, y) = [2 dE, (4) (xe DM), ye A), 
where the domain of integration is ¢, and 


(16) Q(M) = . cH: {ar dE, (A) < of 


Our assertion (1) will not be proved by showing that M = N. 
For any x € H, (14) shows that 


(17) E,,o) = IBoyxll? = ¥ NE(@)P,xI2? = Y EL, «(0 


t= 


where x; = P; x. If x € D(N), then P; Nx = NP; x, so that 


(18) DY | Al? dey, A) = ) INP; xill? = > PiNx|l? = Nx’. 
i=1 i i 


i=l i=1 


It follows from (17) and (18) that the integral in (16) is finite for every 
x € Y(N). Hence 


(19) Q(N) < Z(M). 
If x € A(P;), then x = P;x, and so E(w)x = E(w)x; thus E,, , = 


E‘. , for every y e H. Hence 


(Nx, y) =(NP;x, y) = [2 dE‘. (4) = [2 dE, (A) = (Mx, y). 
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Consequently 

(20) P,;Nx = NP,;x = MP;x [x € AN), i = 1, 2, 3, ...]. 
IfQ,;= P,+--- + P;, it follows that OQ; Nx = MQ;x. Thus 
(21) {Q;x, 0;Nx} € GM) [x € AN), i = 1, 2, 3, ...]. 


Since &(M) is closed, it follows from (5) and (21) that {x, Nx} e€ Y(M), 
that is, that Nx = Mx for every x € D(N). Thus N c M, by (19), and 
now the maximality of N (Theorem 13.32) implies N = M. 

This gives the representation (1), with ¢ in place of o(N). That E 
is actually concentrated on o(N) follows from (c) of Theorem 13.27. 

To prove the uniqueness of E, consider the operator 


(22) T = N(I + ./N*N)7}, 


where ./N*N is the unique positive square root of N*N. If (1) holds, 
it follows from Theorem 13.24 that 


(23) T= | @ dE, 


where $(A) = A/(1 + |A|), so that T € @(A), and since @ is one-to-one 
on ¢, Theorem 13.28 implies that the spectral decomposition E? of T 
satisfies 


(24) E(w) = E*($(a)) 


for every Borel set @ < ¢. The uniqueness of E follows now from that 
of E? (Theorem 12.23). 

Finally, assume S € @(H) and SNCNS. Put Q=@Q, = E(o), 
where @ = {A:|A| <n}, and n is some positive integer. Then 
NQ € &(A) is normal and is given by 


(25) NQ= | fae, 


where f(A) = 4 on @, f(A) = 0 outside @. Theorem 13.28 implies that 
the spectral decomposition E’ of NQ satisfies E'(w) = E(f ~*(@)), or 


16) ap =E(ono)=QEo) if0¢a, 
E'({0}) = E({0} u (C — &) = E({0}) +1 -. 


Hence 
(27) E(w) = QE(w) = QE(a) ifoc@. 
By Theorem 13.24, ON < NQ = QNQ, so that 
(28) (QSQ\(NQ) = QSNQ < QNSQ c (NQ\QSQ). 
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Since (QSQ\NQ) € &(A), the inclusions in (28) are actually equalities. 
Now Theorem 12.23 implies that OSQ commutes with every E'(@). 
Consider a bounded w, and take n so large that w < @. By (27) 


QSE(w) = QSQE(w) = E(w)QSQ = E(w)SQ 
so that 
(29) QO, SE(w) = E(@)SQ,, (a Fae as eee 9 
It now follows from Proposition 12.18 that 
(30) SE(w) = E(a)S 


if w is bounded [let n> oo in (29)], and hence also if w is any Borel 


set in . [/// 


Semigroups of Operators 


13.34 Definitions Let X be a Banach space, and suppose that to every 
t € [0, ©) is associated an operator Q(t) € @(X), in such a way that 


Q(9) = I, 
O(s + t) = Q(s)Q(t) for all s > 0 and t > 0, and 
lim ||Q(t)x — x|| = 0 for every x € X. 


tO 


If (a) and (b) hold, {Q(t)} is called a semigroup (or, more precisely, a 


one-parameter semigroup). Such semigroups have exponential representa- 
tions, provided that the mapping t — Q(t) satisfies some continuity assump- 
tion. The one that is chosen here, namely (c), is easy to work with. 


Motivated by the fact that every continuous complex function that 


satisfies f(s + t) = f(s) f(t) has the form f(t) = exp (At), and that fis deter- 
mined by the number A = f’(0), we associate with {Q(t)} the operators A,, 


by 
1 
(1) a= Ow (x € X, € > 0), 
and define 
(2) Ax = limA, x 
e-0 


for all x € HA), that is, for all x for which the limit (2) exists in the norm 
topology of X. 


It is clear that D(A) is a subspace of X and that A is thus a linear 


operator in X. 
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This operator, which is essentially Q’(0), is called the infinitesimal gen- 
erator of the semigroup {Q(t)}. 


13.35 Theorem [If the semigroup {Q(t)} satisfies the preceding hypothe- 
ses, then 


(a) there are constants C, y such that 
|Q()|| < Ce” (O<t<o); 


(b) t— Q(t)x is a continuous map of [0, «) into X, for every x € X; 
(c) QA) is dense in X and A is closed; 
(d) the differential equation 


d 
ap LOX = AQWx = Q(DAx 


holds for every x € D(A); 
(e) for everyx e X, 


Q(t)x = lim (exp (tA,))x, 
2-0 


the convergence being uniform on every compact subset of [0, 00); and 
(f) ifAe Cand Red > y, the integral 


R(A)x = | “e-MO(t)x dt 


defines an operator R(A) € &(X) [the so-called resolvent of {Q(t)}] whose 
range is D(A) and which inverts AI — A. 


It is remarkable that (e) holds for every x € X, not just for x € DA). 
The limit in (e), as well as the one that is implicit in the derivative used in 
(d), is understood to refer to the norm topology of X. It follows from (f) 
that o(A) lies in the half plane {A: Re A < y}. 


PROOF. (a) If there were a sequence t,—0 with ||Q(t,)|| > oo, the 
Banach-Steinhaus theorem would imply the existence of an x € X for 
which {||Q(t,,)x||} is unbounded, contrary to our assumption that 


(1) |O(t)x — x|| ~0 as t > 0. 


Hence there is a 6 > 0 and a C < o such that ||Q(t)|| < C on [0, 6]. If 
now 0<t< oo and n is the positive integer satisfying (n — 1)d < 
t < nd, then ||Q(t/n)|| < C, and the functional equation 


(2) Ols + t) = QAs)Q(t) 
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leads to 


(3) | O(t)|| = |]O(t/n)"|| < C7 < CPT", 
which proves (a), with e? = C1’. 
(b) If0 <s <t < T, then (a) and (2) imply that 


|| O(t)x — Q(s)xl] < ]Q(s)ll - Ol — s)x — x| 
< Ce’ ||O(t — s)x — xll, 


which tends to Oast —s— 0. 
(c) Because of (b), the X-valued integrals 


1 t 
(4) M,x=+ | Gr ds (x e X,t > 0) 
O 
can be defined. In fact, M, € @(X) and ||M,|| < Ce”, by (a). We claim 
that 
(5) A,M,x =A,M,x (ec >0,t>0,x eX). 
To prove (5), insert the integrand Q(s)x ds into 


i { {" | 
€ 0) t 0 


By (2), the left side becomes 


| [Ole + s) — Ofs)]x ds = [Qle) — 1] | O(s)s ds 


= €A,tM,x. 


In the same way, the right side becomes tA, ¢M, x. This gives (5). 

As e— 0, the right side of (5) converges to A,x. Thus M,x e€ 
(A), which proves that D(A) is dense in X, because M,x x as 
t + 0. Moreover, 

(6) AM,x = A,x (x € X). 


To show that A is closed, suppose x, € D(A), x,—>x, and 
Ax, > y. Since Q(s) commutes with Q(t), A, commutes with M,, and 
therefore A commutes with M, on YA). Thus (6) gives 


A,X, = AM,x, = M,Ax,. 
Letting n — oo, we get 
(7) A,x = M,y. 


As t > 0, the right side of (7) converges, to y; hence the same is true of 
A,x. This says that x € D(A) and that Ax = y. The graph of A is 
therefore closed. 
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(d) Multiplying (6) by t gives 


(8) A | ‘Olsyx ds = Q(t)x — x. 


The integrand is continuous. Differentiation of the integral therefore 
proves (d), since Q(t)Ax = AQ(t)x for x € D(A). [Note that Q(t)A, = 
A, Q(t).] 


(e) We need an estimate for the norm of 
t 
exp (tA,) =e" exp (: a0) 


= gnie FPO) 


ae 
nao Nie 


Replace the norm of this sum by the sum of the norms, apply the 
estimate (a), and sum the resulting series, to obtain 


(9) llexp (tA,)|| < C exp : (ef &% — Df < C exp (te”) 
for 0 < e < 1. Now define, for fixed x € X, 

(10) g(s) = {exp (t— 5)A)JQ(s)x  (O<s<?). 

If x € GA), it follows from (d) that 

(11) o'(s) = {exp ((t — s)A,)}Q(s(Ax — A, x). 
Thus (a) and (9) show that there is a K(t) < oo such that 
(12) llo(s)|| < K(@)||Ax — A, x|| 


whenever 0 <s <t,0 <eé < 1, and x € H(A). 
Since g(t) = Q(t)x and @(0) = {exp (tA,)}x, (12) implies 


(13) |O()x — {exp (tA,)}x|] < tK(O)||Ax — A, xl, 


for x € HA), 0 < « < 1. This gives (e) for x € H(A). 

However, ||Q(t) — exp (tA,)|| is bounded onO0 <t<T,O0<e< 1, 
by (a) and (9). These operators form therefore an equicontinuous 
family (Chapter 4, Exercise 3); it follows that their convergence on the 
dense set Y(A) forces their convergence on all of X (Chapter 2, Exer- 
cise 14). This proves (e). 

(f) It follows from (a) that 


& 


14 R(A)|| < C BORA A gp pea eg 
(14) IR@)| | RSs 
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if Re 4 > y. Thus R(A) € @(X). The definition of R(A) shows that 


6A, R(A)x = |e “ot + €)x dt — | “eon dt. 
0 0 


If we replace t by t — «in the first integral, we are led to 


ett 1 


(15) A, R(A)x = R(A)x — - e7 | “e#O(2)x dt. 


é 


As ¢—0, the right side of (15) converges to AR(A)x — x. This shows 
that R(A)x € D(A) and that 


(16) (AI — A)R(A)x = x (x € X). 
On the other hand, if x € D(A), we can apply (d) to 
(17) R(A)A, x = | e *Q(t)A,x dt 


0 


and see that 
- d 
(18) R(A)Ax = | e 4 at O(t)x dt = —x + AR(A)x 
0 


by an integration by parts. Thus 
(19) R(A\(AI — A)x = x (x € G(A)). 


In particular, D(A) lies in the range of R(A). This completes the proof. 
[/f/ 


It is now natural to ask whether the limit can be removed from the 
conclusion (e), that is, under what conditions the exponential representation 
Q(t) = exp (tA) is valid. Theorems 13.36 and 13.38 give answers to these 
questions. 


13.36 Theorem [If {Q(t)} is as in Theorem 13.35, then any of the follow- 
ing three conditions implies the other two: 


(a) BA)=X. 
(b) lim ||Q(e) — I|| = 0. 


(c) Ae @X) and O(t) = e'4 (0<t < 0). 


PROOF. We shall use the same notations as in the proof of Theorem 
13.35. 
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If (a) holds, the Banach-Steinhaus theorem implies that the 
norms of the operators A, are bounded, for all sufficiently small ¢ > 0. 
Since Q(e) — I = €A,, (b) follows from (a). 

If (b) holds, then also ||M, —I|| ~0 as t— 0. Fix t > 0, so small 
that M, is invertible in @(X). Since M, A, = A, M,, we have 


(1) A, =(M,)*A,M,. 


As ¢— 0, (1) shows first of all that A,x converges, for every x € X 
[since M,x->x and (M,)"'A, € B(X)], second that A =(M,)"'A,, 
and third that 


(2) |A,— All < |(M)° AIM, -T] 70 ase0. 


The formula Q(t) = exp (tA) follows now from (e) of Theorem 13.35, 
since (2) implies that 


(3) lim || exp (tA,) — exp (tA)|| = 0 (0<t< o). 
eO 
Thus (c) follows from (b). 
The implication (c) — (a) is trivial. iii) 


Infinitesimal generators have the following characterization. 


13.37 The Hille-Yosida theorem 4A densely defined operator A in a 
Banach space X is the infinitesimal generator of a semigroup {Q(t)} as in 
Definition 13.34 if and only if there are constants C, y so that 


(1) (AL — A)o™ || < C(A— yy ™ 


for all 4 > y and all positive integers m. 


PROOF. If A is related to {Q(t)} as in Theorem 13.35, we saw there that 
(AI — A)~* = R(A), for A > y, where 


(2) R(A)x = | e~“"O(t)x dt 
O 
is the Laplace transform of Q(t)x. Hence R(A)?x is the transform of the 
convolution 
t 
(3) | Q(t — s)Q(s)x ds = tQ(t)x. 
O 


(The formalism is the same as for Fourier transforms.) Continuing in 
this way, we find that 


(4) R(A)"x = aT | “ym te MO(t)x dt 
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for m = 1, 2, 3,.... Therefore, with C and y as in (a) of Theorem 13.35, 
C m-1,-(A-y)t —m 

(5) | RA) || < in (m—D! | t™” “e ™ dt = C(A—y)™. 

This proves the necessity of (1). 

For the converse, set S(e) = (I — eA)~*, so that (1) becomes 

(6) |S(e)"|] < CL — ey) ™ = =(O<&€ <&,m=1, 2, 3,...), 

and the relations 

(7) (I — eA)S(e)x = x = S(e)T — €A)x 


hold, the first for all x e X, the second for all x € D(A). 
If x € GA), then x — S(e)x = —eéS(e)x, so that 


(8) lim S(e)x = x. 

e>0O 
But since ||S(e)|| < C(1 — eg y)~*, {S(e): 0 < € < 9} is equicontinuous, 
and hence (8) holds for all x € X. 


Next we set 
(9) T(t, 6) = exp (tAS(e)) 
and claim that 
(10) | T(t, &)|| < C exp is (0<&<£&,t> 0). 


Indeed, the relation eAS(e) = S(e) — I [see (7)] shows that 


fe @) ¢” 
(11) T(t, e) =e 
m=0 m!é 


Now (10) follows from (6) and (11). 
For x € GA), (7) and (9) show that 


— “(1 €)T(t, 6)” *x} = T(t, e)T(t, 6)” ‘(S(e) — S(6)) Ax. 
If we integrate this and apply T(t, 0) to the result, we obtain 
(12) T(t, e)x — T(t, 6)x = | Tu, é)T(t — u, 0)(S(e) — S(d))Ax du. 


If we use (8) with Ax in place of x, and refer to (10), we see that 
the right side of (12) converges to O when e ~ 0 and 6 > 0. The limit 


(13) O(t)x = lim T(t, €)x 


E> 
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exists therefore for every x € AA), uniformly on every bounded 
subset of [0, 00). Moreover, (10) shows that ||Q(t)|| < Ce”. By equi- 
continuity, and the assumption that F(A) is dense, we see now that 
(13) holds for all x € X. Since T(t, €) is defined by (9), it follows that 
{Q(t)} is a semigroup, as in Definition 13.34. 

Let A be the infinitesimal generator of {Q(t)}. Then, by (f) of 
Theorem 13.35, 


(14) (I — A)~ 1x = | “e*OWx de A>) 


On the other hand, AS(e) is the infinitesimal generator of 
{exp (tAS(e))} = {T(t, e)}. Thus 


(15) (AI — AS(«))~*x = [emne €)x dt. 
By (13) this becomes 
(16) (AI — A)" 'x = | "e*ovox dt. 


Comparison of (14) and (16) shows now that AI — A and AI — A have 
the same inverse for all sufficiently large A, and this implies that 


A=A,. HI 


For our final theorem, we return to the Hilbert space setting. 


13.38 Theorem Assume that {Q(t): 0 < t < oo} is a semigroup of normal 
operators Q(t) € BA), which satisfies the continuity condition 


(1) lim ||Q()x —x||=0 (xe 4H). 


The infinitesimal generator A of {Q(t)} is then a normal operator in H, 
there is ay < 00 such that Re 4 < y for every 4 € o(A), and 


(2) Q(t)=e'4 (0<t< ~). 

If each Q(t) is unitary, then there is a self-adjoint operator S in H such 
that 
(3) Q(t) = e"® (0<t < o). 


This representation of unitary semigroups is a classical theorem of 
M. H. Stone. 

Note: Although @(A) may be a proper subspace of H, the operators 
e'4 are defined in all of H and are bounded. To see this, let E4 be the 
spectral decomposition of A (Theorem 13.33). Since |e'*| < e for all 
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i € o(A), the symbolic calculus described in Theorem 12.21 allows us to 
define bounded operators e'“ by 


(4) eA — | e* dE“(A) (O<t < o). 
a(A) 


The theorem has an easy converse: If A is as in the conclusion, then 
(2) obviously defines a semigroup of normal operators, and (1) holds 
because 


(5) Q(x — x||* = | |e"* — 1|? dE2 (4) + 0 


a(A) 


as t +Q, by the dominated convergence theorem. 


PROOF. Since each Q(s) commutes with each Q(t). Theorem 12.16 
implies that Q(s) and Q(t)* commute. The smallest closed subalgebra 
of @(H) that contains all Q(t) and all Q(t)* is therefore normal. Let A 
be its maximal ideal space, and let E be the corresponding resolution 
of the identity, as in Theorem 12.22. 

Let f, and a, be the Gelfand transforms of Q(t) and A,, respec- 
tively. Then 


(6) a=+— (> 0), 
and a simple computation gives 

E 
(7) a2, —a, = 5) (a,)’, 


since f,, = (f,)*. Define 
(8) b(p) = lim a, -,(p) 


for those p € A at which this limit exists (as a complex number), and 
define b(p) = 0 at all other p € A. Then b is a complex Borel function 
on A. Put B = (b), as in Theorem 13.24, with domain 


(9) mB) = {xe H: | \oP dE. < of 


Then B is a normal operator in H. 

We will show that A = B. 

If x € HA) then ||A, x|| is bounded, as e > 0. Hence there exists 
C,, < oo such that 


(10) [la dE, ,=||A,x|?7<C, (0<e<1) 
A 
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and therefore 
€ 
(11) |42, — 4,| dE,» 5 Cy (0<6< 1), 
A 


by (7). Take ¢ = 2°" (n= 1, 2, 3, ...) in (11) and add the resulting 
inequalities. It follows that 


(12) ), bape = ses | = 0 a.e. [E,. x]. 

n=1 
The limit (8) exists therefore a.e. [E, .], and now Fatou’s lemma and 
(10) imply that 


(13) [ior dE,» S Cy. 
A 


Consequently, Z(A) < DB). 

Part (a) of Theorem 13.35 shows that || exp (A,)|| < y, < 00 for 
0O<e<1, where y, depends on {Q(t)}. Hence |exp a,p)| < y, for 
every peéA, since the Gelfand transform is an isometry on B*- 
algebras. It now follows from (8) that |exp b(p)| < y, for every pe A. 
Hence there exists y < o0 such that 


(14) Re b(p) < y (p € A). 
For every x € QA) and every t > 0, 


(15)  |lexp (tA,)x — exp (tB)x||7 = | ex (ta,) — exp (tb) |? dE,, , 
A 


tends to 0 as e > 0 through the sequence {2~"}, because the integrand 
is bounded by 4y7‘ and its limit is 0 a.e. [E, .]. Hence (e) of Theorem 
13.35 implies that 


(16) Q(t)x = ex [x e HA)]. 


However, e” is a bounded function on A, e e€ @(H), and since 
(16) shows that the continuous operators Q(t) and e” coincide on the 
dense set (A), we conclude that 


(17) Q(t) = e? (0<t< o). 
It follows from (17) that 
eB I 
(18) A.x—Br=(! - Bs 
so that 


eb 


2 


dE, .. 


—b 


(19) |A,x — Bx||? -| . 


A 
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As e— 0, the integrand (19) tends to 0, at every point of A. Since 
|(e? — 1)/z| is bounded on every half-plane {z: Re z <c}, and since 
the integrand (19) can be written in the form 


e® a | Z 
—1| |b|?, 
it follows from (14) and the dominated convergence theorem that 
(20) lim ||A,x — Bx||? =0 if x € DB). 
e>0O 


This proves that Z(B) < D(A) and that A = B. 

That the real part of o(A) is bounded above follows now from 
(14) and (c) of Theorem 13.27. 

This completes the proof, except for the final statement about 
unitary semigroups. If each Q(t) is unitary, then | f,| = 1, (6) shows 
that lim a, is pure imaginary at every point at which it exists, as e — 0, 
hence b(p) is pure imaginary at every p € A, and if S = —iB then (17) 
gives (3), and (c) of Theorem 13.24 shows that S is self-adjoint. //// 


Exercises 


Throughout this set of exercises, the letter H denotes a Hilbert space, unless the 
contrary is stated. 


1. 


The associative law (T,T,)T;, = T,(T, T;) has been used freely throughout this 
chapter. Prove it. Prove also that T, < T, implies ST, < ST, and T,S c T, S. 


. Let T be a densely defined operator in H. Prove that T has a closed extension if 


and only if D(T*) is dense in H. In that case, prove that T** is an extension 
of T. 


. By Theorem 13.8, (T*) = {0} for a densely defined operator T in H if and only 


if G(T) is dense in H x H. Show that this can actually happen. 

Suggestion: Let {e,:n = 1, 2, 3, ...} be an orthonormal basis of H; let 
{x,} be a dense subset of H; define Te, = x,; and extend T linearly to A(T), the 
set of all finite linear combinations of the basis vectors e,. Show that the graph 
of this T is dense in H x H. 


. Suppose T is a densely defined, closed operator in H, and T*T < TT*. Does it 


follow that T is normal? 


. Suppose JT is a densely defined operator in H, and (Tx, x)=O for every 


x € 9(T). Does it follow that Tx = 0 for every x € A(T)? 


. If T is an operator in H, define 


N(T) = {x € HT): Tx = 0}. 
If A(T) is dense, prove that 
N(T*) = AT) A AT*). 
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If T is also closed, prove that 
N(T) = AT*)* O AT). 


This generalizes Theorem 12.10. 


. Consider the following three boundary value problems. The differential equa- 


tion is 
f” —f= J; 


where g € 1?([0, 1]) is given. The choices of boundary conditions are 
(Zi) f(0)=f(l) =0. 

(ii) f'(0) =f") = 0. 

(iii) f(0) =f(1) and f’(0) = f'(1). 

Show that each of these problems has a unique solution f such that f’ is 
absolutely continuous and f” € L([0, 1]). Hint: Combine Example 13.4 with 
Theorem 13.13. 

Do this also by solving the problems explicitly. 


. (a) Prove the self-adjointness of the operator T in L7(R), defined by Tf = if’, 


with YT) consisting of all absolutely continuous f € 7? such that f’ € 2. 
Hint: You may need to know that f(t)-0 as t> +o for every 
f¢ HT). Prove this. Or prove more, namely, that every fe DT) is the 
Fourier transform of an L'-function. 
(b) Fix g € -(R). Use Theorem 13.13 to prove that the equation 


f" -—f=g9 


has a unique absolutely continuous solution fe 2, which has f’e LP, 
f" € Land f’ absolutely continuous. 
Prove also, by direct calculation, that 


1 [* {| 
f(x) = -5{ e’ *g(t) a5 e*~~ ‘g(t) dt. 


— 0 x 


This solution can also be found by means of Fourier transforms. 


. Let H? be the space of all holomorphic functions f(z) = )° c, z" in the open unit 


disc that satisfy 


00 
Fl? = > Leal? < 00. 
n=0 


Show that H? is a Hilbert space which is isomorphic to 7” via the one-to-one 
correspondence f< {c,}. 

Define V € @(H’) by (Vf\(z) = 2f(z). Show that V is the Cayley transform 
of the symmetric operator T in H?, given by 


14+2 
1—z 


(Tf\z) = i F(z). 


Find the ranges of T + il and of T — il; show that one is H? and one has 
codimension 1. (Compare with Example 13.21.) 


10. 


11. 
12. 


13. 
14, 


15. 


16. 


17. 


18. 


19. 
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With H? as in Exercise 9, define V now by 
(Vf)(z) = 2f (27). 


Show that V is an isometry which is the Cayley transform of a closed symmetric 

operator T in H’, whose deficiency indices are 0 and oo. 

Prove part (c) of Lemma 13.18. 

(a) In the context of Theorem 13.24, how are the operators Y(f+ g) and 
Y(f) + P(g) related? 

(b) If f and g are measurable and g is bounded, prove that ‘V(g) maps J, 
into D,. 

(c) Prove that ¥(/) = Y(g) if and only if f= g ae. [E], that is, if and only if 


E({p: f(p) # g(p)}) = 9. 


Is the operator C that occurs in the proof of Theorem 13.33 normal? 
Prove that every normal operator N in H, bounded or not, has a polar decom- 
position 


N = UP = PU, 


where U is unitary, P is self-adjoint, P > 0. Moreover, A(P) = AN). 
Prove the following extension of Theorem 12.16: If T € @(H), if M and N are 
normal operators in H, and if TM c NT, then also TM* c N*T. 
Suppose T is a closed operator in H, A(T) = AT*), and || Tx|| = ||T*x|| for 
every x € Y(T). Prove that T is normal. Hint: Begin by proving that 

(Tx, Ty) =(T*x, T*y) (x € AT), ye AT)). 


Prove that the spectrum o(T) of any operator T in H 1s a closed subset of C. 
(See Definition 13.26.) Hint: If ST < TS = I, and S € @(H), then S(I — AS)~* is 
a bounded inverse of T — M, for small | A]. 

Put ¢(t) = exp (—t?). Define S € AI’), where P = L?(R), by 


(SPO=eOfe-) (fe L), 


so that (S*f)\(t) = d(t)d(t — 1) f(t — 2), etc. (Note that S is presented in its polar 
decomposition S = PU.) 
Find S*. Compute that 


(n — 1)n(n + 1) 


i ee ee 
5 (n ) 


||S"|| = exp } = 


Conclude that S is one-to-one, that @(S) is dense in I’, and that o(S) = {0}. 
Define T, with domain A(T) = A(S), by 


TSf=f (fel). 
Prove that o(T) is empty. 
Let T,, T,, T; be as in Example 13.4, put 


ATs) = {f € HT): f(0) = 9}, 
and define T, f = if’ for all f € D(T%4). 
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PART III: BANACH ALGEBRAS AND SPECTRAL THEORY 


Prove the following assertions: 
(a) Every A € € is in the point spectrum of T,. 
(b) o(T,) consists of the numbers 22n, where n runs through the integers; each of 
these is in the point spectrum of T). 
(c) A(T, — AD has codimension 1 for every 4 € ¢. Hence o(T;) = ¢. The point 
spectrum of 7; is empty. 
(d) o(T,) is empty. 
Hint: Study the differential equation if’ — Af = g. 
This illustrates how sensitive the spectrum of a differential operator is to 
its domain (in this case, to the boundary conditions that are imposed). 
Show that every nonempty closed subset of ¢ is the spectrum of some normal 
operator in H (if dim H = oo). 
Define Q(t) € B72), where ? = L-(R), by 


(QW) f(s) =f(s + 2). 


(a) Prove that each Q(t) is unitary. 

(b) Prove that {Q(t)} satisfies the conditions stated in Definition 13.34. 

(c) If A is the infinitesimal generator of {Q(t)}, prove that fe D(A) if and only if 
{ly f(y)|? dy < oo (where f is the Fourier transform of f) and that Af =f’ 
for all fe DA). 

(d) Prove that o(A) is the imaginary axis. More precisely, show that A — AI is 
one-to-one for every J € ¢, that / lies in the resolvent set of A if and only if 
A is not pure imaginary, and that the range of A — AI is a proper dense 
subspace of I? if A is pure imaginary. 

Hint: g € AA — Al) if and only if g € L and also g(y)/(iy — A) is in LV. 

If f € H? (see Exercise 9) and f(z) = > c,z”, define 


[Q(t) f \(z) = 2 (n+ 1) ‘c, 2" (0<t < o). 


Show that each Q(t) is self-adjoint (and positive). Find the infinitesimal gener- 
ator A of the semigroup {Q(t)}. Is A self-adjoint? Show that A has pure point 
spectrum, at the points log 1, log 3, log 4,.... 

For f € P-(R), x € R,0 < y < ©, define 


00 


| y 
[O) 10x) = - {" Bap LOM 
and put Q(0)f =f. Show that {Q(y): 0 < y < o} satisfies the conditions imposed 
in Definition 13.34 and that ||Q(y)|| = 1 for all y. 

[The integral represents a harmonic function in the upper half-plane, with 
boundary values f, The semigroup property of {Q(y)} can be deduced from this, 
as well as from a look at the Fourier transforms of the functions Q(y) f] 

Find the domain of the infinitesimal generator A of {Q(y)}, and prove that 


where H is the Hilbert transform (Chapter 7, Exercise 24). 
Prove that — A is positive and self-adjoint. 
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24. Show that every isometric operator in H has a closed isometric extension. 
25. On the other hand, show that some symmetric operators in H have no closed 
symmetric extension by completing the following outline. 
Let {e,, e,, 3, -..} be an orthonormal basis of H. Let X be the set of all 
finite sums )| a;e;, subject to the condition )) «; = 0. Prove that X is a dense 
subspace of H. Define U € @(A) by 


uo( 5 xe = Hye, — oe ae;, 
1 2 


and let V be the restriction of U to X. Then V is an isometry, with AV) = xX, 
and I — V is one-to-one on X. Hence V is the Cayley transform of a symmetric 
operator T. Any closed symmetric extension of T would correspond to a closed 
isometric extension V, of V, with I — V, one-to-one. But AV) is dense in H; 
hence V has only one closed isometric extension, namely U, and I — U is not 
one-to-one. 


APPENDIX 


COMPACTNESS 
AND CONTINUITY 


Al Partially ordered sets A set # is said to be partially ordered by a 
binary relation < if: 


(i) a< band b<cimpliesa<c. 
(ii) a< aforeveryae &, 
(iii) a< band b < aimplies a = b. 


A subset 2 of a partially ordered set F is said to be totally ordered if 
every pair a, b € 2 satisfies either a < b or b < a. 
Hausdorff’s maximality theorem states: 


Every nonempty partially ordered set P contains a totally ordered 
subset 2 which is maximal with respect to the property of being totally 
ordered. 


A proof (using the axiom of choice) may be found in [23]. Explicit 
applications of the theorem occur in the proofs of the Hahn-Banach 
theorem, of the Krein-Milman theorem, and of the theorem that every 
proper ideal in a commutative ring with unit lies in a maximal ideal. It will 
now be applied once more (A2) to prepare the way to an easy proof of the 
Tychonoff theorem. 
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A2 Subbases A collection ¥ of open subsets of a topological space X is 
said to be a subbase for the topology t of X if the collection of all finite 
intersections of members of SY forms a base for t. (See Section 1.5.) Any 
subcollection of S whose union is X will be called an Y-cover of X. By 
definition, X is compact provided that every open cover of X has a finite 
subcover. It is enough to verify this property for /-covers: 


Alexander’s subbase theorem. If S is a subbase for the topology of a 
space X, and if every S-cover of X has a finite subcover, then X is 
compact. 


PROOF. Assume X is not compact. We will deduce from this that X 
has an -cover I’ without finite subcover. 

Let P be the collection of all open covers of X that have no 
finite subcover. By assumption, A # @. Partially order A by inclu- 
sion, let Q be a maximal totally ordered subcollection of Y, and let I 
be the union of all members of QO. Then 


(a) I is an open cover of xX, 
(b) YT has no finite subcover, but 
(c) IT u {V} has a finite subcover, for every open V ¢ I. 


Of these, (a) is obvious. Since Q is totally ordered, any finite 
subfamily of I lies in some member of Q, hence cannot cover X; this 
gives (b), and (c) follows from the maximality of Q. 

Put fF =I a &. Since fF cT, (b) implies that I has no finite 
subcover. To complete the proof, we show that I covers X. 

If not, some x € X is not covered by I. By (a), x € W for some 
W eT. Since ¥ is a subbase, there are sets V,,..., V, € Y such that 
x € (| Vc W. Since x is not covered by I, no V, belongs to I’. Hence 
(c) implies that there are sets Y,, ..., Y,, each a finite union of 
members of I, such that X = V, U Y, for 1 <i<n. Hence 


KV re Oe Ve Is YW 


which contradicts (b). //// 


A3 Tychonoff’s theorem If X is the cartesian product of any nonempty 
collection of compact spaces X,,, then X is compact. 


PROOF. If 2,(x) denotes the X,-coordinate of a point x € X, then, by 
definition, the topology of X is the weakest one that makes each 
T,: & — X, continuous; see Section 3.8. Let Y, be the collection of all 
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sets z, ‘(V,), where V, is any open subset of X,. If Y is the union of all 
SF ,, it follows that Y is a subbase for the topology of X. 

Suppose [I is an #-cover of X. Put ©, =I a #,. Assume (to 
get a contradiction) that no [’, covers X. Then there corresponds to 
each « a point x, € X, such that I’, covers no point of the set 2; ‘(x,), 
and if x € X is chosen so that 2,(x) = x,, then x is not covered by I. 
But I" is a cover of X. 

Hence at least one I’, covers X. Since X, is compact, some finite 
subcollection of I, covers X. Since [, < I’, T has a finite subcover, 
and now Alexander’s theorem implies that X is compact. //// 


A4 Theorem If K is a closed subset of a complete metric space X, then 
the following three properties are equivalent: 


(a) K is compact. 
(b) Every infinite subset of K has a limit point in K. 
(c) K is totally bounded. 


Recall that (c) means that K can be covered by finitely many balls of 
radius ¢, for every ¢ > 0. 


PROOF. Assume (a). If E < K is infinite and no point of K is a limit 
point of E, there is an open cover {V,} of K such that each V, contains 
at most one point of E. Therefore {V,} has no finite subcover, a con- 
tradiction. Thus (a) implies (5). 

Assume (b), fix ¢ > 0, and let d be the metric of X. Pick x, € K. 
Suppose x,, ..., X, are chosen in K so that d(x,;,x,)>e if ij. If 
possible, choose x,,, € K so that d(x;, x,4,,)>6é for 1<i<n. This 
process must stop after a finite number of steps, because of (b). The 
é-balls centered at x,,..., x, then cover K. Thus (b) implies (c). 

Assume (c), let P be an open cover of K, and suppose (to reach a 
contradiction) that no finite subcollection of I covers K. By (c), K is a 
union of finitely many closed sets of diameter <1. One of these, say 
K,, cannot be covered by finitely many members of I. Do the same 
with K, in place of K, and continue. The result is a sequence of closed 
sets K; such that 


(i) KOK, >K,2°°"°, 
(ii) diam K, < 1/n, and 
(iii) no K, can be covered by finitely many members of I. 


Choose x, € K,,. By (i) and (ii), {x,} is a Cauchy sequence which 
(since X is complete and each K, is closed) converges to a point 
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xe€ () K,,. Hence x € V for some V eT. By (ii), K, < V when n is 
sufficiently large. This contradicts (iii). Thus (c) implies (a). //// 


Note that the completeness of X was used only in going from (c) to (a). In 
fact, (a) and (b) are equivalent in any metric space. 


A5 Ascoli’s theorem Suppose X is a compact space, C(X) is the sup- 
normed Banach space of all continuous complex functions on X, and 
® < C(X) is pointwise bounded and equicontinuous. More explicitly, 


(a) sup {| f(x)|: fe ®} < oo for every x € X, and 
(b) if ¢>0, every x € X has a neighborhood V such that | f(y) — f(x)| < «é 
for all y € V and for all f € ®. 


Then ® is totally bounded in C(X). 


Corollary. Since C(X) is complete, the closure of ® is compact, and 
every sequence in ® contains a uniformly convergent subsequence. 


PROOF. Fix ¢ > 0. Since X is compact, (b) shows that there are points 
X1,..., X, € X, with neighborhoods V,, ..., V,, such that X = |) V, 
and such that 


(1) | f(x) —f(x;)| <e (fe, xeV,,1<i<n). 


If (a) is applied to x,, ..., x, in place of x, it follows from (1) that ® is 
uniformly bounded: 


(2) sup {| f(x)|:x eX, fe ®} = M < oo. 

Put D = {Ae €: |A| < M}, and associate to each fe ® a point p(f) € 
D" c ¢, by setting 

(3) PF) = (F(%1), ++ £(%n))- 


Since D” is a finite union of sets of diameter <e. there exist f,, ..., 
tm € P such that every p(f) lies within ¢ of some p(f,). 
If f € ®, there exists k, 1 < k < m, such that 


(4) f(x) -—fxdl<e (Usisn). 
Every x € X lies in some V,, and for this i 
(5) If) -f)l<e and — | f(x) —flxi)| < «. 


Thus | f(x) — f,(x)| < 3e for every x € X. 
The 3¢-balls centered at f,, ..., f, therefore cover D. Since ¢ was 
arbitrary, ® is totally bounded. //// 
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A6 Sequential continuity If X and Y are Hausdorff spaces and if 
f maps X into Y, then f is said to be sequentially continuous provided 
that lim,..,, f(x,) =f(x) for every sequence {x,} in X that satisfies 
lim 2 eae a 


noo 


Theorem 


(a) Iff: X > Y is continuous, then f is sequentially continuous. 


(b) Iff: X — Y is sequentially continuous, and if every point of X has a 
countable local base (in particular, if X is metrizable), then f is 
continuous. 


PROOF. (a) Suppose x, > x in X, V is a neighborhood of f(x) in Y, 
and U =f ~'(V). Since f is continuous, U is a neighborhood of x, and 
therefore x, € U for all but finitely many n. For these n, f(x,) € V. 
Thus f(x,) > f(x) as n> oo. 

(b) Fix x € X, let {U,,} be a countable local base for the topol- 
ogy of X at x, and assume that fis not continuous at x. Then there is 
a neighborhood V of f(x) in Y such that f~‘(V) is not a neighbor- 
hood of x. Hence there is a sequence x,, such that x, e U,, x, > x as 
n> oo, and x, ¢f~ ‘(V). Thus f(x,) ¢ V, so that f is not sequentially 
continuous. //// 


A7 Totally disconnected compact spaces A topological space X is 
said to be totally disconnected if none of its connected subsets contains 
more than one point. 

A set E < X is said to be connected if there exists no pair of open sets 
V,, V, such that 


BG VANS Vag EnV, #42, EOvV,#4 2, 
butE NV, nV, = ©. 


Theorem. Suppose K <V <—X, where X is a compact Hausdorff 
space, V is open, and K is a component of X. Then there is a compact 
open set A suchthat K ACV. 


Corollary. If X is a totally disconnected compact Hausdorff space, then 
the compact open subsets of X form a base for its topology. 


PROOF. Let I’ be the collection of all compact open subsets of X that 
contain K. Since X eT, IT # @. Let H be the intersection of all 
members of I. 

Suppose H < W, where W is open. The complements of the 
members of I form an open cover of the compact complement of W. 
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Since [ is closed under finite intersections, it follows that A <c W for 
some A eI. 
We claim that H is connected. To see this, assume H = Hy U 
H,, where H, and H, are disjoint compact sets. Since K ¢ H and K is 
connected, K lies in one of these. Say K < H,. By Urysohn’s lemma, 
there are disjoint open sets W,, W, such that Hp c W, H, < W,, and 
the preceding paragraph shows that some A €T satisfies A c Wo U 
W,. Put Ag=ANW,. Then KcCAj, Ay is open, and Apo is 
compact, because A 1 W, = An W. Thus Ay € I. Since H < Ap, it 
follows that H, = ©. 
Thus H is connected. Since K < H and K is a component, we 
see that K = H. The preceding argument, with K and V in place of H 
and W, shows that A c V forsome 4 eT. //// 
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NOTES AND 
COMMENTS 


The abstract tendency in analysis which developed into what is now known 
as functional analysis began at the turn of the century with the work of 
Volterra, Fredholm, Hilbert, Fréchet, and F. Riesz, to mention only some of 
the principal figures. They studied integral equations, eigenvalue problems, 
orthogonal expansions, and linear operations in general. It is of course no 
accident that the Lebesgue integral was born in the same period. 

The normed space axioms appear in F. Riesz’ work on compact oper- 
ators in C([La, b]) (Acta Math., vol. 41, pp. 71-98, 1918), but the first abstract 
treatment of the subject is in Banach’s 1920 thesis (Fundam. Math., vol. 3, 
pp. 133-181, 1922). His book [2], published in 1932, was tremendously 
influential. It contains what is still the basic theory of Banach spaces, but 
with some omissions which, from our vantage point, seem curious. 

One of these is the complete absence of complex scalars, in spite of 
Wiener’s observation (Fundam. Math., vol. 4, pp. 136-143, 1923) that the 
axioms can be formulated just as well over ¢, and, more importantly, that a 
theory of Banach-space-valued holomorphic functions can then be devel- 
oped whose basic features are very similar to the classical complex-valued 
case. Very little (if anything) was done with this until 1938. (See the notes 
for Chapter 3 in this appendix.) 

Even more puzzling, in retrospect, is Banach’s treatment of weak 
convergence—surely one of his most important contributions to the subject. 
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In spite of the vigorous development of topology in the twenties, and in 
spite of von Neumann’s explicit description of weak neighborhoods in a 
Hilbert space and in operator algebras (Math. Ann., vol. 102, pp. 370-427, 
1930; see p. 379), Banach deals only with weakly convergent sequences. 
Since the adjunction of all limits of weakly convergent subsequences of a set 
need not lead to a weakly sequentially closed set (see Exercise 9, Chapter 3), 
he is forced into complicated notations such as transfinite closures, but he 
never uses the much simpler and more satisfactory concept of weak topol- 
ogies.* 

Occasionally, unnecessary separability assumptions are made in [2]. 
This is also true of von Neumann’s axiomatization of Hilbert space (Math. 
Ann., vol. 102, pp. 49-131, 1930), where separability is included among the 
defining properties. In this fundamental paper on unbounded operators, he 
establishes the spectral theorem for them, thus generalizing what Hilbert 
had done for the bounded ones more than 20 years earlier. Another basic 
contribution to operator theory was M. H. Stone’s 1932 book [28]. 

Although continuous functions obviously play an important role in 
Banach’s book, he considers only their vector space structure. They are 
never multiplied. But multiplication was not neglected for very long. In his 
work on the tauberian theorem (Ann. Math., vol. 33, pp. 1-100, 1932) 
Wiener stated and used the fact that the Banach space of absolutely con- 
vergent Fourier series satisfies the multiplicative inequality ||xy|| < ||x|l|ly|l. 
M.H. Stone’s generalization of the Weierstrass approximation theorem 
(Trans. Amer. Math. Soc., vol. 41, pp. 375-481, 1937; especially pp. 453-481) 
is undoubtedly the best-known instance of the explicit use of the ring struc- 
ture of spaces of continuous functions. Von Neumann’s interest in operator 
theory, whick stemmed from quantum mechanics, led him to a systematic 
study of operator algebras. M. Nagumo (Jap. J. Math., vol. 13, pp. 61-80, 
1936) initiated the abstract study of normed rings. But what really got this 
subject off the ground was Gelfand’s discovery of the important role played 
by the maximal ideals of a commutative algebra (Mat. Sbornik N. S., vol. 9, 
pp. 3-24, 1941) and his construction of what is now known as the Gelfand 
transform. 

Before the middle forties, the interest of functional analysts was 
focused almost exclusively on normed spaces. The first major paper on the 
general theory of locally convex spaces is that of J. Dieudonné and L. 
Schwartz in Ann. Inst. Fourier (Grenoble), vol. 1, pp. 61-101, 1949. One of 


' Banach is obviously one of the major heroes of this story. The preceding remarks are not 
intended to be in any way derogatory (as some readers of the first edition thought) or to 
belittle the importance and originality of his work. Their intent is merely to contrast our 
present mathematical environment with what it was then. 
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its principal motivations was Schwartz’ construction of the theory of dis- 
tributions [26]. (The first version of this book appeared in 1950.) Just as 
Banach and Gelfand had predecessors, so did Schwartz. As Bochner points 
out in his review of Schwartz’ book (Bull. Amer. Math. Soc., vol. 58, pp. 
78-85, 1952), the idea of “ generalized functions” goes back at least as far as 
Riemann. It was applied in Bochner’s Vorlesungen tuber Fouriersche Inte- 
grale (Leipzig, 1932), a book that played a very important role in the devel- 
opment of harmonic analysis. Sobolev’s work also predates Schwartz. But it 
was Schwartz who built all this into a smoothly operating very general 
structure that turned out to have many applications, especially to partial 
differential equations. 

The following expository articles describe some of the history of our 
subject in greater detail. 


Bonsall, F. F.: “A Survey of Banach Algebra Theory,” Bull. London Math. 
Soc., vol. 2, pp. 257-274, 1970. 

Hildebrandt, T. H.: “Integration in Abstract Spaces,” Bull. Amer. Math. 
Soc., vol. 59, pp. 111-139, 1953. 

Horvath, J.: “An Introduction to Distributions,” Amer. Math. Monthly, vol. 
77, pp. 227-240, 1970. 

Lorch, E. R.: “The Structure of Normed Abelian Rings,” Bull. Amer. Math. 
Soc., vol. 50, pp. 447-463, 1944. 

Taylor, A. E.: “Notes on the History and Uses of Analyticity in Operator 
Theory,” Amer. Math. Monthly, vol. 78, pp. 331-342, 1971. 

Tréves, F.: “Applications of Distributions to PDE Theory,” Amer. Math. 
Monthly, vol. 77, pp. 241-248, 1970. 


Volume 1 of the series Studies in Mathematics (published by the Math- 
ematical Association of America, 1962, edited by R. C. Buck) contains arti- 
cles by 


Goffman, C.: “ Preliminaries to Functional Analysis ” 

Lorch, E. R.: “The Spectral Theorem ” 

McShane, E. J.: “A Theory of Limits” 

Stone, M. H.: “A Generalized Weierstrass Approximation Theorem ” 


There are two special issues of Bull. Amer. Math. Soc.: One (May 
1958) is devoted to the work of John von Neumann; the other (January 
1966) to that of Norbert Wiener. 

The origins of functional analysis are well described in Dieudonné’s 
book [36]. 

We now give detailed references to some items in the text. 
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Chapter 1 


For the general theory of topological vector spaces, see [5], [14], [15], 
[31], [32]. 

Section 1.8 (e). In Banach’s definition of an F-space, he postulated 
only the separate continuity of scalar multiplication and proved that joint 
continuity was a consequence. See [4], pp. 51-53, for a proof based on 
Baire’s theorem. Another proof (due to S. Kakutani) does not require com- 
pleteness of X but uses Lebesgue measure in the scalar field; see [33], 
pp. 31-32. . 

Theorem 1.24. This metrization theorem was first proved (in the more 
general context of topological groups) by G. Birkhoff (Compositio Math., 
vol. 3, pp. 427-430, 1936) and by S. Kakutani (Proc. Imp. Acad. Tokyo, vol. 
12, pp. 128-142, 1936). Part (d) of the theorem is perhaps new. 

Section 1.33. The Minkowski functional of a convex set is sometimes 
called its support function. 

Theorem 1.39 is due to A. Kolmogorov (Studia Math., vol. 5, pp. 
29-33, 1934). It may well be the first theorem about locally convex spaces. 

Section 1.46. The construction of the function g by repeated averag- 
ing may be found on pp. 80-84 of S. Mandelbrojt’s 1942 Rice Institute 
Pamphlet “Analytic Functions and Classes of Infinitely Differentiable Func- 
tions,” where it is credited to H. E. Bray. 

Section 1.47. Of particular interest among the F-spaces that are not 
locally convex but have enough continuous linear functionals to separate 
points are certain subspaces of I?, the H?-spaces (with 0 < p < 1). For a 
detailed study of these, see the paper by P. L. Duren, B. W. Romberg, and 
A. L. Shields in J. Reine Angew. Math., vol. 238, pp. 32-60, 1969, and those 
by Duren and Shields in Trans. Amer. Math. Soc., vol. 141, pp. 255-262, 
1969, and in Pac. J. Math., vol. 32, pp. 69-78, 1970, as well as [40]. 


Chapter 2 


Basically, all results of this chapter are in [2]. 

Exercise 11. Charles Horowitz constructed a bilinear map from 
R° x R® onto R* which is not open at (0, 0), in Proc. Amer. Math. Soc., vol. 
53, pp. 293-294, 1975. P. J. Cohen (J. Func. Anal., vol. 16, pp. 235-239, 
1974) had earlier constructed a much more complicated example, mapping 
¢* x ¢' onto 7’. 

Exercise 13. A barrel is a closed, convex, balanced, absorbing set. A 
space is barreled if every barrel contains a neighborhood of 0. Exercise 13 
asserts: Topological vector spaces of the second category are barreled. 
There exist barreled spaces of the first category, and certain versions of the 
Banach-Steinhaus theorem are valid for them. See [14], p. 104; also [15]. 


NOTES AND COMMENTS 401 


Barreled spaces with the Heine-Borel property are often called Montel 
spaces; see Sec. 1.45. 


Chapter 3 


Theorem 3.2 is in [2]. Its complex version, Theorem 3.3, was proved by 
H. F. Bohnenblust and A. Sobczyk, Bull. Amer. Math. Soc., vol. 44, pp. 
91-93, 1938, and by G. A. Soukhomlinoff, Mat. Sbornik, vol. 3, pp. 353-358, 
1938. The latter also considered quaternion scalars. In Proc. Amer. Math. 
Soc., vol. 50, pp. 322-327, 1975, J. A. R. Holbrook presents a proof in which 
real scalars are not treated separately, and he includes a simplified version 
of Nachbin’s work on Hahn-Banach extensions of linear transformations 
(in place of linear functionals); see Trans. Amer. Math. Soc., vol. 68, pp. 
2846, 1950. 

Theorem 3.6. For a partial converse, see J. H. Shapiro, Duke Math. J., 
vol. 37, pp. 639-645, 1970. 

Theorem 3.15. See L. Alaoglu, Ann. Math., vol. 41, pp. 252—267, 1940. 
For separable Banach spaces, the theorem is in [2], p. 123. 

Theorem 3.18. A shorter proof, based on seminorms, may be found 
on p. 223 of [32]. 

Section 3.22. Compact convex sets with no extreme point exist in 
some F-spaces. See [40]. 

Theorem 3.23 was proved, for weak*-compact convex subsets of the 
dual of a Banach space, by M. Krein and D. Milman, in Studia Math., vol. 
9, pp. 133-1940. 

Theorem 3.25 appeared in Dokl. Akad. Nauk SSSR, vol. 57, pp. 119- 
122, 1947. 

The history of vector-valued integration is described by T. H. Hilde- 
brandt in Bull. Amer. Math. Soc., vol. 59, pp. 111-139, 1953. The “weak” 
integral of Definition 3.26 was developed by B. J. Pettis, Trans. Amer. Math. 
Soc., vol. 44, pp. 277-304, 1938. 

The history of vector-valued holomorphic functions is described by 
A. E. Taylor in Amer. Math. Monthly, vol. 78, pp. 331-342, 1971. 

Theorem 3.31. That weakly holomorphic functions (with values in 
a complex Banach space) are strongly holomorphic was proved by 
N. Dunford in Trans. Amer. Math. Soc., vol. 44, pp. 304-356, 1938. 

Theorem 3.32 was used by A. E. Taylor to prove that the spectrum of 
every bounded linear operator on a complex Banach space is nonempty 
(Bull. Amer. Math. Soc., vol. 44, pp. 70-74, 1938). Since every Banach 
algebra A is isomorphic to a subalgebra of @(A) (see the proof of Theorem 
10.2), Taylor’s result contains (a) of Theorem 10.13. 

Exercise 9 is due to von Neumann, Math. Ann., vol. 102, pp. 370-427, 
1930; see p. 380. 
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Exercise 10 is patterned after a construction in the appendix of [2]. 

Exercise 25. If K is also separable and metric, then such a yp exists 
even on E, rather than on E. This is Choquet’s theorem. See [20]. For a 
recent paper on this, see R. D. Bourgin, Trans. Amer. Math. Soc., vol. 154, 
pp. 323-340, 1971. 

Exercise 28 (c). This is the easy part of the Eberlein-Smulian theorem. 
See [4], pp. 430-433 and p. 466. Another characterization of weak com- 
pactness has been given by R. C. James, Trans. Amer. Math. Soc., vol. 113, 
pp. 129-140, 1964: A weakly closed set S in a Banach space X is weakly 
compact if and only if every x* € X* attains its supremum on S. 

Exercise 33. See [14], p. 133. 


Chapter 4 


A large part of this chapter is in [2]. 

Compact operators used to be called completely continuous. As defined 
by Hilbert (in 77) this means that weakly convergent sequences are mapped 
to strongly convergent ones. The presently used definition was given by 
F. Riesz (Acta Math., vol. 41, pp. 71-98, 1918). In reflexive spaces, the two 
definitions coincide (Exercise 18). 

Section 4.5. R. C. James has constructed a nonreflexive Banach space 
X which is isometrically isomorphic with X** (Proc. Natl. Acad. Sci. USA, 
vol. 37, pp. 174-177, 1951). 

Theorems 4.19 and 4.25 were proved by J. Schauder (Studia Math., 
vol. 2, pp. 183-196, 1930). For generalizations to arbitrary topological 
vector spaces, see J. H. Williamson, J. London Math. Soc., vol. 29, pp. 149- 
156, 1954; also [5], chap. 9. 

Exercise 13. It was a problem of long-standing whether every 
compact operator in every separable Banach space can be approximated (in 
the operator norm) by operators with finite-dimensional ranges. The first 
counterexample was constructed by P. Enflo, in Acta Math., vol. 130, pp. 
309-317, 1973. (This also gave a negative solution to the so-called basis 
problem.) Ramifications of this approximation problem are discussed in 
[41]. 

Exercise 15. These operators are usually called Hilbert-Schmidt oper- 
ators. See [4], chap. XI. 

Exercise 17. Operators of this type are discussed by A. Brown, P. R. 
Halmos, and A. L. Shields in Acta Sci. Math. Szeged., vol. 26, pp. 125-137, 
1965. 

Exercise 19. This “max-min duality” was exploited by W. W. Rogo- 
sinski and H. S. Shapiro to obtain very detailed information about certain 
extremum problems for holomorphic functions. See Acta Math., vol. 90, pp. 
287-318, 1953. 
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Exercise 21. This was proved by M. Krein and V. Smulian in Ann. 
Math., vol. 41, pp. 556-583, 1940. See also [4], pp. 427-429. 


Chapter 5 


Theorem 5.1. For a more general version, see R. E. Edwards, J. London 
Math. Soc., vol. 32, pp. 499-501, 1957. 

Theorem 5.2 is due to A. Grothendieck, Can. J. Math., vol. 6, pp. 
158-160, 1954. His proof is less elementary than the one given here. 

Theorem 5.3. For more on trigonometric series with gaps, see J. 
Math. Mech., vol. 9, pp. 203-228, 1960; also, sec. 5.7 of [24], J. P. Kahane’s 
article in Bull. Amer. Math. Soc., vol. 70, pp. 199-213, 1964, and [42]. 

Theorem 5.5 was first proved by A. Liapounoff, Bull. Acad. Sci. USSR, 
vol. 4, pp. 465-478, 1940. The proof of the text is due to J. Lindenstrauss, J. 
Math. Mech., vol. 15, pp. 971-972, 1966. J. J. Uhl (Proc. Amer. Math. Soc., 
vol. 23, pp. 158-163, 1969) generalized the theorem to measures whose 
values lie in a reflexive Banach space or in a separable dual space. 

Theorem 5.7. The idea to use Krein-Milman to prove Stone- 
Weierstrass is due to L. de Branges, Proc. Amer. Math. Soc., vol. 10, pp. 
822-824, 1959. E. Bishop’s generalization is in Pac. J. Math., vol. 11, pp. 
777-783, 1961. The proof given here is that of I. Glicksberg, Trans. Amer. 
Math. Soc., vol. 105, pp. 415-435, 1962. C. Hamburger pointed out to me 
that one does not need to assume that A contains the constants. A very 
elementary approach to Bishop’s theorem was found by Mao Chao-Lin, C. 
R. Acad. Sci. Paris, vol. 301, pp. 349-350, 1985. 

Theorem 5.9. Bishop proved this in Proc. Amer. Math. Soc., vol. 13, 
pp. 140-143, 1962. For the special case of the disc algebra, see Proc. Amer. 
Math. Soc., vol. 7, pp. 808-811, 1956, and L. Carleson’s paper in Math Z., 
vol. 66, pp. 447-451, 1957. Other applications occur in Chapter 6 of [25] 
and in Chapter 10 of [45]. See also [29]. 

Theorem 5.10. The proof follows that of M. Heins, Ann. Math., vol. 
52, pp. 568-573, 1950, where the same method is applied to a large class of 
interpolation problems. 

Theorem 5.11 was proved by S. Kakutani in Proc. Imp. Acad. Tokyo, 
vol. 14, pp. 242-245, 1938. The proof given here was communicated to me 
by Isaac Namioka and is due to F. Hahn, Math. Systems Theory, vol. 1, pp. 
55-57, 1968. The lemma avoids the use of nets and subnets at the end of the 
proof. 

Theorem 5.14. This simple construction of the Haar measure of a 
compact group is essentially that of von Neumann (Compositio Math., vol. 
1, pp. 106-114, 1934). His is even more elementary and self-contained, 
though a little longer, since he uses no fixed point theorem. (In Trans. 
Amer. Math. Soc., vol. 36, pp. 445-492, 1934, he uses the same method to 
construct mean values of almost periodic functions.) If compactness is 
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replaced by local compactness, the construction of Haar measure becomes 
more difficult. See [18], [11], [16]. 

Theorem 5.18 was proved (for Banach spaces) in Proc. Amer. Math. 
Soc., vol. 13, pp. 429-432, 1962. For further results on uncomplemented 
subspaces, see H. P. Rosenthal’s 1966 AMS Memoir Projections onto 
Translation-Invariant Subspaces of I?(G) and his paper in Acta Math., vol. 
124, pp. 205-248, 1970. There are also positive results. For example co is 
complemented in any separable Banach space which contains it 
(isomorphically) as a closed subspace. A very short proof of this theorem of 
A. Sobczyk was obtained by W. A. Veech in Proc. Amer. Math. Soc., vol. 28, 
pp. 627-628, 1971. 

Section 5.19. That H' is uncomplemented in L’ was first proved by 
D. J. Newman, Proc. Amer. Math. Soc., vol. 12, pp. 98-99, 1961. The proof 
given here is in Proc. Amer. Math. Soc., vol. 13, pp. 429-432, 1962. 

Theorem 5.21. F. F. Bonsall’s paper in Quart. J. Math. Oxford, vol. 
37, pp. 129-136, 1986, contains this and further applications of Theorem 
D:22: 

Theorems 5.23, 5.28. The history of these fixed point theorems is 
described on pp. 470-471 of [4]. A proof of Brouwer’s theorem that is both 
elementary and simple may be found on pp. 38-40 of Dimension Theory by 
Hurewicz and Wallman, Princeton University Press, Princeton, N.J., 1948. 


Chapter 6 


The standard reference is, of course, [26]. See also [5], [8], [27], [31]. [13] 
contains a very concise introduction to the subject. 

Definition 6.3. BQ) is here topologized as the inductive limit of the 
Fréchet spaces 9,(Q). See [15], pp. 217-225, for a systematic discussion of 
this notion in an abstract setting. 


Chapter 7 


For those aspects of Fourier analysis that are related to distributions, we 
refer to [26] and [13]. The group-theoretic aspects of the subject are dis- 
cussed in [11] and [24]. The standard work on Fourier series is [34]. 

Theorem 7.4. The intimate relation between Fourier transforms and 
differentiation is no accident; Fourier series were invented, in the eighteenth 
century, as tools to solve differential equations. 

Theorem 7.5 is sometimes called the Riemann-Lebesgue lemma. 

Theorem 7.9 was originally proved by M. Plancherel in Rend. 
Palermo., vol. 30, pp. 289-335, 1910. 
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Theorems 7.22 and 7.23. These proofs are as in [13] but contain more 
details. 

Theorem 7.25 is due to S. L. Sobolev, Mat. Sbornik, vol. 4, pp. 471- 
497, 1938. 

Exercise 16. This is taken from L. Schwartz’ first counterexample to 
the spectral synthesis problem (C. R. Acad. Sci. Paris, vol. 227, pp. 424—426, 
1948). For further information on this problem, see C. S. Herz (Trans. 
Amer. Math. Soc., vol. 94, pp. 181-232, 1960) and chap. 7 of [24]. 

Exercise 17. See C. S. Herz, Ann. Math., vol. 68, pp. 709-712, 1958. 


Chapter 8 


General references: [1], [13], [27], [30]. 

The existence of fundamental solutions (Theorem 8.5) was established 
independently by L. Ehrenpreis (Amer. J. Math., vol. 76, pp. 883-903, 1954) 
and by B. Malgrange in his thesis (Ann. Inst. Fourier, vol. 6, pp. 271-355, 
1955-1956). Lemma 8.3 is Malgrange’s. He proves it for Fourier transforms 
f of test functions. He integrates over a ball where we have used a torus. As 
far as applications are concerned, this makes hardly any difference. The 
point is to get some useful majorization of f by fP, that is, to have division 
by P under control. Ehrenpreis solved this division problem in a different 
way and went on to solve more general division problems of this type. See 
[13] and [30] for further references and more detailed results. 

It is essential in Theorem 8.5 that the coefficients of the differential 
operator under consideration be constant. This follows from an equation 
constructed by H. Lewy (Ann. Math., vol. 66, pp. 155-158, 1957), which has 
C’ coefficients but no solution. HOrmander ([13], chap. VI) has investigated 
this nonexistence phenomenon very completely. 

Section 8.8. Many other types of Sobolev spaces have been studied. 
See [13], chap. II. 

Theorem 8.12. See K. O. Friedrichs, Comm. Pure Appl. Math., vol. 6, 
pp. 299-325, 1953, and P. D. Lax, Comm. Pure Appl. Math., vol. 8, pp. 
615-633, 1955. Lax treats the periodic case first, via Fourier series, and then 
uses the bootstrap proposition to obtain the general case. He does not 
assume that the highest-order terms are constant. See also [4], pp. 1703- 
1708. 

Exercise 10. G is the so-called “ Green’s function” of P(D). 

Exercise 16. This is a theorem about zero sets of homogeneous poly- 
nomials (with complex coefficients) in R”. See [1], p. 46. 


Chapter 9 


Section 9.1. See A. Tauber, Monatsh. Math., vol. 8, pp. 273-277, 1897, and 
J. E. Littlewood, Proc. London Math. Soc., vol. 9, pp. 434448, 1910. 
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Theorem 9.3. The use of distributions in this proof is as in J. 
Korevaar’s paper in Proc. Amer. Math. Soc., vol. 16, pp. 353-355, 1965. 

Theorem 9.4 to Theorem 9.7. N. Wiener, Ann. Math., vol. 33, pp. 
1-100, 1932, and H. R. Pitt, Proc. London Math. Soc., vol. 44, pp. 243-288, 
1938. Later proofs gave various generalizations; see [24], p. 159, for further 
references. See also A. Beurling, Acta Math., vol. 77, pp..127—136, 1945. 

Section 9.9. The prime number theorem was first proved, indepen- 
dently, by J. Hadamard (Bull. Soc. Math. France, vol. 24, pp. 199-220, 1896) 
and by Ch. J. de la Vallée-Poussin (Ann. Soc. Sci. Bruxelles, vol. 20, pp. 
183-256, 1896). Both used complex variable methods. Wiener gave the first 
tauberian proof, as an application of his general theorem. “Elementary ” 
proofs were found in 1949 by A. Selberg and by P. Erdos. For a simpler 
elementary proof, see N. Levinson, Amer. Math. Monthly, vol. 76, pp. 225- 
245, 1969. The complex variable proofs still give the best error estimates; 
see W. J. Le Veque, Topics in Number Theory, vol. I, p. 251, Addison- 
Wesley Publishing Company, Reading, Mass., 1956. 

Theorem 9.12. A. E. Ingham, J. London Math. Soc., vol. 20, pp. 171- 
180, 1945. 

The material on the renewal equation is from S. Karlin, Pac. J. Math., 
vol. 5, pp. 229-257, 1955, where references to earlier work may be found. 
Nonlinear versions of the renewal equation are discussed by J. Chover and 
P. Ney in J. d’ Analyse Math., vol. 21, pp. 381-413, 1968; see also B. Henry, 
Duke Math. J., vol. 36, pp. 547-558, 1969. 

Exercise 7. This approximation problem is much less delicate in L’. 
See [23], sec. 9.16. 


Chapter 10 


General references: [7], [12], [16], [19], [21]. In [16] and [21], a great 
deal of basic theory is developed without assuming the presence of a unit. 
[21] contains some material about real algebras. 

Gelfand’s paper (Mat. Sbornik, vol. 9, pp. 3-24, 1941) contains Theo- 
rems 10.2, 10.13, and 10.14, some symbolic calculus, and Theorem 11.9. For 
Fourier transforms of measures, the spectral radius formula (b) of Theorem 
10.13 had been obtained earlier by A. Beurling (Proc. 1X Congres de Math. 
Scandinaves, Helsingfors, pp. 345-366, 1938). See also the note to Theorem 
3.32. 

Theorem 10.9. The commutative case was obtained independently by 
A. M. Gleason (J. Anal. Math., vol. 19, pp. 171-172, 1967) and by J. P. 
Kahane and W. Zelazko (Studia Math., vol. 29, pp. 339-343, 1968). W. 
Zelazko (Studia Math., vol. 30, pp. 83-85, 1968) removed the commutativity 
hypothesis. The proof given in the text contains some simplifications. See 
also Theorem 1.4.4 of [3], and J. A. Siddiqi, Can. Math. Bull., vol. 13, pp. 
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219-220, 1970. M. Roitman and Y. Sternfeld (Trans. Amer. Math. Soc., vol. 
267, pp. 111-124, 1981) found a more algebraic proof, which uses no func- 
tion theory. Related results concerning ideals of finite codimension were 
found by N. V. Rao (J. Func. Anal., vol. 82, pp. 237-258, 1989). 

Theorem 10.19. H. A. Seid (Amer. Math. Monthly, vol. 77, pp. 282- 
283, 1970) obtains the same conclusions, without assuming that A has a 
unit, if M = 1. 

Theorem 10.20 says that o(x) is an upper semicontinuous function of 
x. An example of Kakutani ([21], p. 282) shows that o(z) is not, in general, a 
continuous function of x. See also Exercise 20. 

Section 10.21. The terms operational calculus or functional calculus are 
also frequently used. [12] contains a very thorough treatment of the sym- 
bolic calculus in Banach algebras. 

Theorem 10.34 (d) is due to E. R. Lorch (Trans. Amer. Math. Soc., vol. 
52, pp. 238-248, 1942). 

Theorem 10.35. Lomonosov’s proof was published in Func. Anal. and 
Appl., vol. 7, pp. 55-56, 1973. Even for a single operator it is much simpler 
and more far-reaching than anything that was known before. A. J. Michaels 
gave an account of Hilden’s contribution in Adv. in Math., vol. 25, pp. 
56-58, 1977. 

As regards earlier work, N. Aronszajn and K. T. Smith (Ann. Math., 
vol. 60, pp. 345-350, 1954) proved that every compact operator on a 
Banach space has a proper invariant subspace. A. R. Bernstein and A. 
Robinson (Pac. J. Math., vol. 16, pp. 421-431, 1966) obtained the same 
conclusion for bounded operators T on a Hilbert space that have p(T) 
compact for some polynomial p. Their proof uses nonstandard analysis; 
P. R. Halmos converted it into one that uses only classical concepts (Pac. J. 
Math., vol. 16, pp. 433-437, 1966). 

Since some operators, even on a Hilbert space, commute with no 
compact one (Exercise 26), Lomonosov’s theorem does not settle the invari- 
ant subspace problem. In fact, operators without invariant subspaces have 
been found in certain nonreflexive Banach spaces (P. Enflo, Acta Math., vol. 
158, pp. 213-313, 1987), and even in 7’ and c, (C. J. Read, Proc. London 
Math. Soc., vol. 53, pp. 583-607, 1989). See also Section 12.27. 

Exercise 22. This is one of the simplest cases of the Arens-Royden 
theorem for commutative Banach algebras. It relates the group G/G, to the 
topological structure of the maximal ideal space of A. See H. L. Royden’s 
article in Bull. Amer. Math. Soc., vol. 69, pp. 281-298, 1963, that by R. 
Arens in F. T. Birtel, ed., Function Algebras, pp. 164-168, Scott, Foresman 
and Company, Glenview, IIl., 1966, and [6] and [29]. 

Exercise 23. For the precise structure of G/G, in this case, see J. L. 
Taylor, Acta Math., vol. 126, pp. 195-225, 1971. 

Exercise 24. See C. Le Page, C. R. Acad. Sci. Paris, vol. 265, pp. 
A235—A237, 1967. 
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Exercise 26. The invariant subspaces of this shift operator are com- 
pletely known. This is Beurling’s theorem (Acta Math., vol. 81, pp. 239-255, 
1949). Helson and Lowdenslager (Acta Math., vol. 99, pp. 165-202, 1958) 
used different methods and extended Beurling’s theorem to other settings. 


Chapter 11 


Theorem 11.7. The case n = 1 was proved in elementary fashion by P. J. 
Cohen in Proc. Amer. Math. Soc., vol. 12, pp. 159-163, 1961. For n > 1, the 
proof of the text seems to be the only one that is known. 

Theorem 11.9. When A has no unit, then A is locally compact (but 
not compact) and A < C,(A); the origin of A* is then in the closure of A. 
See [16], pp. 52-53. 

Theorem 11.10 is what has been called an “automatic continuity ” 
theorem. (Theorems 10.7 and 11.31 are other examples.) This is a concept 
which brings classical analysis into contact with axiomatic set theory. For 
example, “Kaplansky’s problem” is the following: Is it true, for every 
compact Hausdorff space X and every Banach algebra A, that every homo- 
morphism from C(X) into A is continuous? The work of Dales, Esterle, 
Solovay, and Woodin has shown that this question is undecidable in ZFC 
(Zermelo-Frankel set theory plus the axiom of choice). See [38] for details. 

Example 11.13 (d) shows why there are very close relations between 
commutative Banach algebras, on the one hand, and holomorphic functions 
of several complex variables on the other. This topic is not at all pursued in 
the present book. Very good, up-to-date accounts of it may be found in the 
books by Browder [3]. Gamelin [6], Stout [29], and Wermer [47]. A sym- 
bolic calculus for functions of several Banach algebra elements can be 
developed. See R. Arens and A. P. Calderon, Ann. Math., vol. 62, pp. 204— 
216, 1955, and J. L. Taylor, Acta Math., vol. 125, pp. 1-38, 1970. 

Example 11.13 (e) shows why certain parts of Fourier analysis may be 
derived easily from the theory of Banach algebras. This is done in [16] and 
[24]. 

Theorem 11.18 was proved by Gelfand and Naimark in Mat. Sbornik, 
vol. 12, pp. 197-213, 1943. In the same paper they also proved that every 
B*-algebra A (commutative or not) is isometrically *-isomorphic to an 
algebra of bounded operators on some Hilbert space (Theorem 12.41), if 
e+ x*x is invertible for every x € A. That this additional hypothesis is 
redundant was proved 15 years later by I. Kaplansky [(/) of Theorem 
11.28]. See [21], p. 248, for references to the rather tangled history of this 
theorem. B. J. Glickfeld (Ill. J. Math., vol. 10, pp. 547-556, 1966) has shown 
that A is a B*-algebra if ||exp (ix)|| = 1 for every hermitian x € A. 

Theorem 11.20. The idea to pass from A to A/R, in order to prove the 
theorem without assuming the involution to be continuous, is due to J. W. 
M. Ford (J. London Math. Soc., vol. 42, pp. 521-522, 1967). 
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Theorem 11.23. See R. S. Foguel, Ark. Mat., vol. 3, pp. 449-461, 1957. 

Theorem 11.25. See P. Civin and B. Yood, Pac. J. Math., vol. 9, pp. 
415-436, 1959; especially p. 420. Also [21], p. 182. 

Theorem 11.28. A recent treatment of these matters was given by V. 
Ptak, Bull. London Math. Soc., vol. 2, pp. 327-334, 1970. Also, see the note 
to Theorem 11.18. 

Theorem 11.31. See [19], [21]. H. F. Bohnenblust and S. Karlin (Ann. 
Math., vol. 62, pp. 217-229, 1955) have found relations between positive 
functionals, on the one hand, and the geometry of the unit ball of a Banach 
algebra on the other. 

Theorem 11.32. See [7]. Also [16], p. 97, and [21], p. 230. 

Theorem 11.33 is in [20], for continuous involutions. 

Exercise 13. Part (g) contradicts the second half of corollary (4.5.3) in 
[21]. It also affects Theorem (4.8.16) of [21]. 

Exercise 14. This was first proved by S. Bochner (Math. Ann., vol. 
108, pp. 378-410, 1933; especially p. 407), using essentially the same 
machinery that we used in Theorem 7.7. See [24] for a somewhat different 
proof. The proof that is suggested here shows that the presence or absence 
of a unit element makes a difference in studying positive functionals. See 
[16], p. 96, and [21], p. 219. 


Chapter 12 


General references: [4], [9], [10], [17], [22]. 

Theorem 12.16. B. Fuglede proved the case M=WN in Proc. Natl. 
Acad. Sci. USA, vol. 36, pp. 35-40, 1950, including the unbounded case 
(Chapter 13, Exercise 15). His proof used the spectral theorem and was 
extended to the case M # N by C. R. Putnam (Amer. J. Math., vol. 73, pp. 
357-362, 1951), who also obtained Theorem 12.36. The short proof of the 
text is due to M. Rosenblum, J. London Math. Soc., vol. 33, pp. 376-377, 
1958. 

Theorem 12.22. The extension process that is used here to go from 
continuous functions to bounded ones is as in [16], pp. 93-94. 

Theorem 12.23. See [4], pp. 926-936, for historical remarks about the 
spectral theorem. See also P. R. Halmos’ article in Amer. Math. Monthly, 
vol. 70, pp. 241-247, 1963, for a different description of the spectral 
theorem. 

Theorem 12.38 was proved by P. R. Halmos, G. Lumer, and J. 
Schaffer, in Proc. Amer. Math. Soc., vol. 4, pp. 142-149, 1953. D. Deckard 
and C. Pearcy (Acta Sci., Math. Szeged., vol. 28, pp. 1-7, 1967) went further 
and proved that the range of the exponential function is neither open nor 
closed in the group of invertible operators. Their paper contains several 
references to intermediate results. 

Theorem 12.39. See [21], p. 227. 
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Theorem 12.41. Closed *-subalgebras of @(H) are called C*-algebras. 
Before Theorem 12.41 was known (see the note to Theorem 11.18), 
B*-algebras were studied separately, but now the term B*-algebra is no 
longer used much. 

Theorems 12.43, 12.44. Several types of ergodic theorems are dis- 
cussed in [4] and [43]. 

Exercise 2 is very familiar if N = 4. 

Exercise 18. The relation between shift operators and the invariant 
subspace problem is discussed by P. R. Halmos in J. Reine Angew. Math., 
vol. 208, pp. 102-112, 1961. 

Exercise 27. See P. Civin and B. Yood, Pac. J. Math., vol. 9, pp. 415- 
436, 1959, for many results about involutions. 

Exercise 32. Part (c) implies that every uniformly convex Banach 
space is reflexive. See Exercise 1 of Chapter 4 and the note to Exercise 28 of 
Chapter 3. All [?-spaces (with 1 < p < 0) are uniformly convex. See J. A. 
Clarkson, Trans. Amer. Math. Soc., vol. 40, pp. 396-414, 1936, or [15], pp. 
355-359. 


Chapter 13 


General references: [4], [12], [22]. 

Theorem 13.6 was first proved by A. Wintner, Phys. Rev., vol. 71, pp. 
738-739, 1947. The more algebraic proof of the text is H. Wielandt’s, Math. 
Ann., vol. 121, p. 21, 1949. It was generalized by D. C. Kleinecke (Proc. 
Amer. Math. Soc., vol. 8, pp. 535-536, 1957), to yield the following theorem 
about derivations: If D is a continuous linear operator in a Banach algebra 
A such that D(xy) = xDy + (Dx)y for all x, y € A, then the spectral radius of 
Dx is 0 for every x that commutes with Dx. This was also proved by Shiro- 
kov (Uspehi, vol. 11, no. 4, pp. 167-168, 1956) and, in commutative Banach 
algebras, by Singer and Wermer (Math. Ann., vol. 129, pp. 260-264, 1955). 
See p. 20 of I. Kaplansky’s article “Functional Analysis” in Some Aspects of 
Analysis and Probability, John Wiley & Sons, New York, 1958. 

A. Brown and C. Pearcy (Ann. Math., vol. 82, pp. 112-127, 1965) have 
proved, for separable H, that an operator T € @(H) is a commutator if and 
only if T is not of the form M + C, where A # 0 and C is compact. See also 
C. Schneeberger, Proc. Amer. Math. Soc., vol. 28, pp. 464—472, 1971. 

The Cayley transform, its relation to deficiency indices, and the proof 
of Theorem 13.30 are in von Neumann’s paper in Math. Ann., vol. 102, pp. 
49-131, 1929-1930, and so is the spectral theorem for normal unbounded 
operators. The material on graphs is in his paper in Ann. Math., vol. 33, pp. 
294-310, 1932. Our proof of Theorem 13.33 is like that of F. Riesz and E. R. 
Lorch, Trans. Amer. Math. Soc., vol. 39, pp. 331-340, 1936. See also [4], 
chap. XII. 


NOTES AND COMMENTS 4]1 


Definition 13.34. The continuity condition we impose can be 
weakened: if (a) and (b) hold, and if Q(t)x > x weakly, as t—0, for every 
x € X, then (c) holds. See [33], pp. 233-234. The proof uses more from the 
theory of vector-valued integration than the present book contains. 

Theorems 13.35-13.37 are proved in [4], [12], [22], [33], and [46]. 

Theorem 13.38. This is due to M. H. Stone, Ann. Math., vol. 33, pp. 
643-648, 1932; see also B. Sz. Nagy, Math. Ann. vol. 112, pp. 286-296, 1936. 

Exercise 25 was pointed out to me by Sheldon Axler. It corrects an 
error made on p. 341 of the first edition of this book. 


Appendix A 


Section A2. J. W. Alexander, Proc. Natl. Acad. Sci. USA, vol. 25, pp. 296— 
298, 1939. 

Section A3. A. Tychonoff proved this for cartesian products of inter- 
vals (Math. Ann., vol. 102, pp. 544-561, 1930) and used it to construct what 
is now known as the Cech (or Stone-Cech) compactification of a completely 
regular space. E. Cech (Ann. Math., vol. 38, pp. 823-844, 1937; especially 
p. 830) proved the general case of the theorem and studied properties of the 
compactification. Thus it appears that Cech proved the Tychonoff theorem, 
whereas Tychonoff found the Cech compactification—a good illustration of 
the historical reliability of mathematical nomenclature. 
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